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5 

ACTIVATION OF HCV-SPECDIC T CELLS 

TECHNICAL AREA OF THE INVENTION 

1 0 The invention relates to the activation of hepatitis C virus(HCV>specific T 

cells. More particularly, the invention relates to the use of multiple HCV 
polypeptides, either alone or as fusions, to stimulate cell-mediated immune responses, 
such as to activate HCV-specific T cells. 

15 BACKGROUND OF THE INVENTION 

Hepatitis C virus (HCV) infection is an important health problein with 
approximately 1% of the world's population infected with the virus. Ovct 75% of 
acutely infected individuals eventually progress to a chronic carrier state that can 
result in cirrhosis, liver failure, and hepatocellular carcinoma See Alter et al (1992) 

20 N. Engl. J. Med. 327: 1 899-1 905; Resnick and Koff. (1 993) Arch. Intern, Med. 

153:1672-1677; Seeff (1995) Gastrointest. Dis. 6:20-27; Tong etal (1995) N. Engl. J. 
Med 332:1463-1466. 

Despite extensive advances in the development of pharmaceuticals against 
certam viruses like HIV, control of acute and chronic HCV infection has had limited 

25 success (Hoofiiagle and di Bisceglie (1997) N. Engl. J. Med. 336:347-356). In 

particular, generation of a strong cytotoxic T lymphocyte (CTL) response is tfiought 
to be important for the control and eradication of HCV infections. Thus, there is a 
need in the art for effective methods of inducing strong CTL responses against HCV. 
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SUMMARY OF THE INVENTION 

It is an object of the invention to provide reagents and methods for stimulating 
immune responses, such as activating T cells which recognize epitopes of HCV 
polypq)tides. This and other objects of the invention are provided by one or more of 
5 the embodiments described below. 

The mvention provides HCV proteins useful for stimulating immune 
responses, such as activating HCV-specific T cells. One embodiment provides a 
fiision protein that comprises HCV polypeptides, wherein the HCV polypeptides 
consist essentially of anNS3, anNS4, anNSSa polypeptide, and optionally a core 
1 0 polypeptide. In certain embodiments, the fusion protein includes an NS5b 
polypeptide. 

In certain embodiments, at least one of the HCV polypeptides is derived from 
a different strain of HCV than the other polypeptides. 

The invention also provides compositions comprising any of these fusion 

1 S proteins and a pharmaceutically acceptable excipient. In certain embodiments, the 
compositions further comprise an adjuvant, a CpG polynucleotide and/or the fusion 
protein is adsorbed to or entrapped within a microparticle or ISCOM. The 
compositions can further comprise a polynucleotide encoding an E1E2 complex. The 
B1E2 polynucleotide can also be adsorbed to or entrapped withing a microparticle. 

20 Another embodiment provides a composition comprising HCV polypeptides 

and a pharmaceutically acceptable excipient. The HCV polypeptides consist 
essentially of anNS3, anNS4, an NS5a polypeptide, and optionally a core 
polypeptide. In certain embodiments, the composition includes an NS5b polypeptide. 
In other embodiments, the compositions further comprise an adjuvant, a CpG 

25 polynucleotide and/or one or more of the HCV polypeptides is adsorbed to or 

entrapped within a microparticle or ISCOM. The compositions can further comprise a 
polynucleotide encoding an E1E2 complex. The E1E2 polynucleotide can also be 
adsorbed to or entrapped withing a microparticle. 

Moreover, one of the HCV polypeptides may be derived from a different strain of 
30 HCV than the others. 
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Even another embodiment of the mvention provides an isolated and purified 
polynucleotide which encodes a fusion protein as described above. In additional 
embodiments, the fusion proteins further include a polynucleotide encoding an E1E2 
complex. 

5 Yet another embodiment of the invention provides a composition comprising 

the polynucleotides described above and a pharmaceutically acceptable excipient In 
certain embodiments, the compositions further comprise an adjuvant and/or the 
polynucleotide may be adsorbed to or entrapped within a microparticle. The 
compositions can further comprise a polynucleotide encoding an B1E2 complex. The 

10 E1E2 polynucleotide can also be adsorbed to or entrapped withing a microparticle. 
In a further embodiment, the invention provides a composition comprising 
HCV polynucleotides and a phannaceutically acceptable excipient, wherein the HCV 
polynucleotides consist essentially of polynucleotides encodmg an NS3, an NS4, an 
NS5a polypeptide, and optionally a core polypeptide. In certain embodiments, the 

1 5 composition also includes a polynucleotide encoding an NS5b polypeptide. The 
compositions may further comprise an adjuvant and/or one or more of the 
polynucleotides may be adsorbed to or entrapped within a microparticle. The 
compositions can further comprise a polynucleotide encoding an E1E2 complex. The 
E1E2 polynucleotide can also be adsorbed to or entrapped withing a microparticle. 

20 Additionally, one or more of the polynucleotides may be derived from a different 
strain of HCV than the others. 

In another embodiment, the invention provides a method of activating T cells 
which recognize an epitope of an HCV polypeptide. T cells are contacted with any of 
the fusions, polynucleotides or compositions described above, A population of 

25 activated T cells recognizes an epitope of the NS3, NS4, NS5a, NS5b, core and/or 
E1E2 polypeptide. 

In the proteins and polynucleotides above, the regions in the fusions need not 
be in the order in which they naturally occur in the native HCV polyprotein. Thus, for 
example, the NS5b polypeptide, if present, may be at the N- and/or C-terminus of the 
30 fusion, or may be located internally. Similarly, the El polypeptide may precede or 
follow the E2 polypeptide. The E1E2 polypeptide may also be part of the 
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nonstructural fusion protein or may be provided separately, as an E1E2 complex, or as 

individual polypeptides. 

Moreover, the NS3 polypeptide may include a modification to inhibit protease 

activity, such that cleavage of the fusion is inhibited. Such modifications are 
5 described more fiilly below. Additionally, the compositions can comprise more than 

one HCV nonstructural fusion protein, such as a fusion protein with NS3, NS4 and 

NS5a, and a fusion protein with NS3, NS4, NS5a, NS5b and E1E2. The E1E2 

complexes, whether present separately or as part,of the fusion, can have varying E1E2 

polypeptides (described more fully below). 
10 In certain embodiments, the nonstructural fusion protein consists of, from the 

amino terminus to the carboxyl terminus, an NS3, an NS4, an NS5a and, optionally, 

an NS5b polypeptide and the E1E2 complex consists of, from amino terminus to the 

carboxyl terminus, an El polypeptide and an E2 polypeptide. 

The various polypeptides (and polynucleotides encoding therefor) are derived 
1 5 from the same HCV isolate, or from different strains and isolates including isolates 

having any of the various HCV genotypes, to provide increased protection against a 

broad range of HCV genotypes. 

Yet another embodiment of the invention provides a method of stimulating an 

immune response, such as a cellular immune response, in a vertebrate subject by 
20 administering a composition as described herein. In certain embodiments, the 

composition activates T cells which recognize an epitope of an HCV polypeptide. T 

cells are contacted with a composition as described above. A population of activated 

T cells recognizes an epitope of one or more of the HCV polypeptide(s). 

The invention thus provides methods and reagents for stimulating immune 
25 responses to HCV, such as for activating T cells which recognize epitopes of HCV 

polypeptides. These methods and reagents are particularly advantageous for 

identifying epitopes of HCV polypeptides associated with a strong CTL response and 

for immunizing mammals, including humans, against HCV. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a diagraimnatic representation of the HCV genome, depicting the 
various regions of the HCV polyprotein. 

Figure 2 (SEQ ID NOS: 9 and 10) depicts the DNA and corresponding amino 
5 acid sequence of a representative native NS3 protease domain. 

Figures 3A-3C (SEQ ID N0S:3 and 4) shows the nucleotide and 
corresponding amino acid sequence for the HCV-1 El/E2/p7 region. The numbers 
shown in the figure are relative to the full-length HCV-1 polyprotein. The El, E2 and 
p7 regions are shown. 
10 Figure 4 is a diagram of plasmid pMHElE2-809, encoding E1E2809, a 

representative E1E2 protein for use with the present invention. 

Figures 5A-5J (SEQ ID N0S:7 and 8) depict the DNA and corresponding 
amino acid sequence of a representative NS345Core fusion protein. The depicted 
sequmce includes amino acids 1242-301 1 of the HCV polyprotein (representing 
15 polypeptides firom NS3, NS4, NS5a and NS5b) with amino acids 1-121 of the HCV 
polyprotein (representing a polypeptide fix>m the core region) fused to the C-terminus 
of NS5b. This numbering is relative to the HCV-1 polyprotein. 

Figure 6 shows a side-by-side comparison of IFN-y expression generated in 
animals in response to delivery of alphavims constructs encoding NS3NS4NS5a. 
20 Figure 7 shows IFN-y expression generated in animals in response to delivery 

of plasmid DNA encoding NS3NS4NS5a ("naked"), PLG-hnked DNA encoding 
NS'3NS4NS5a C'PLG), separate DNA plasmids encoding NS5a, NS34a, and NS4ab 
("naked"), and PLG-linked DNA encoding NS5a, NS34a, and NS4ab ("PLG"). 

Figure 8 shows HCV-specific CD8+ and CD4-i- responses in vaccinated 
25 chimpanzees. 

Figure 9 depicts the sfpecificity of T cell responses primed by electroporation 
of plasmid DNA two weeks subsequent to the third immunization. 

Figure 10 shows the specificity of T cell responses primed by vaccinating 
chimpanzees with NS345Corei2i-ISCOMS two weeks subsequent to the third 
30 immunization. 
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DETAILED DESCRIPTION OF THE INVENTION 

The practice of the present invention will employ, unless otherwise indicated, 
conventional methods of chemistry, biochemistry, recombinant DNA techniques and 
immunology, within the skill of the art Such techniques are e>q)lained fully in the 
5 literature. See, e.g., Sambrook, et al., Molecular Cloning: A Laboratory Manual (2nd 
Edition); Methods In Enzymology (S. Colowick and N, Kaplan eds.. Academic Press, 
Inc.); DNA Cloning, Vols. I and II (D.N. Glover ed.); Oligonucleotide Synthesis (M.J. 
Gait ed.); Nucleic Acid Hybridization (B.D. Haines & S.J. Higgins eds.); Animal Cell 
Culture (R-K. Freshney ed.); Perbal, B., A Practical Guide to Molecular Cloning, 
10 It must be noted that, as used in this specification and the appended claims, the 

singular forms "a", "an" and "the" include plural referents unless the content clearly 
dictates otherwise. Thus, for example, reference to "an antigen" inchxdes a mixture of 
two or more antigens, and the like. 

The following amino acid abbreviations are used throughout the text: 
15 Alanine: Ala (A) Arginine: Arg (R) 

Asparagine: Asn (N) Aspartic acid: Asp (D) 

Cysteine: Cys (C) Glutamine: Gin (Q) 

Glutamic acid: Glu (E) Glycine: Gly (G) 
Histidine: His (H) Isoleucine: lie (I) 

20 Leucine: Leu (L) Lysine: Lys (K) 

Methionine: Met (M) Phenylalanine: Phe (F) 

ProUne: Pro (P) Serine: Ser (S) 

Threonine: Thr (T) Tryptophan: Trp (W) 

Tyrosine: Tyr (Y) Valine: Val (V) 

25 

L Definitions 

In desCTibing the present invention, the following terms will be employed, and 
are intended to be defined as mdicated below. 

The terms "polypeptide" and "protein" refer to a polymer of amino acid 
30 residues and are not limited to a minimum length of the product. Thus, peptides, 
oligopeptides, dimers, multimers, and the like, are included within the definition. 
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Both full-length proteins and fragments thereof are encompassed by the definition. 
The terms also include postexpression modifications of the polypeptide, for example, 
glycosylation, acetylation, phosphorylation and the like. Furthermore, for purposes of 
the present invention, a ^^polypeptide" refers to a protein which includes 
5 modifications, such as deletions, additions and substitutions (generally conservative in 
nature), to the native sequence, so long as the protein maintains the desired activity. 
These modifications may be deliberate, as through site-directed mutagenesis, or may 
be accidental, such as through mutations of hosts which produce the proteins or errors 
due to PGR amplification. 

10 An HCV polypeptide is a polypeptide, as defined above, derived from the 

HCV polyprotein. The polypeptide need not be physically derived from HCV, but 
may be synthetically or recombinantly produced. Moreov^, the polypeptide may be 
derived from any of the various HCV strains and isolates including isolates having any 
of the 6 genotypes of HCV described in Simmonds et al., J. Gen. Virol (1993) 

15 74:2391-2399 (e.g., strains 1, 2, 3, 4 etc.), as well as newly identified isolates, and 
subtypes of these isolates, such as HCVla, HCVlb, etc. A number of conserved and 
variable regions are known between these strains and, in general, the amino acid 
sequences of epitopes derived from these regions will have a high degree of sequence 
homology, e.g., amino acid sequence homology of more than 30%, preferably more 

20 than 40%, when the two sequences are aligned. Thus, for example, the term 'TSrS4" 
polypeptide refers to native NS4 from any of the various HCV strains, as well as NS4 
analogs, muteins and immunogenic fragments, as defined fiarther below. 

By an "El polypeptide" is meant a molecule derived from an HCV El region. 
The mature El region of HCV- 1 begins at approximately amino acid 192 of the 

25 polyprotein and continues to approximately amino acid 383, numbered relative to the 
full-length HCV-1 polyprotein. (See, Figures \ and 3A-3C. Amino acids 192-383 of 
Figures 3A-3C correspond to amino acid positions 20-21 1 of SEQ ID N0;4.) Amino 
acids at around 173 through approximately 191 (ammo acids 1-19 of SEQ ED NO: 4) 
serve as a signal sequence for El. Thus, by an "El polypeptide" is meant eitiier a 

30 precursor El protein, including the signal sequence, or a mature El polypeptide which 
lacks this sequence, or even an El polypeptide wifli a heterologous signal sequence. 
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The El polypeptide includes a C-tenninal membrane anchor sequence which occurs at 
approximately amino acid positions 360-383 (see. International Publication No. WO 
96/04301, published February 15, 1996). An El polypeptide, as defined herein, may 
or may not include the C-teiminal anchor sequence or portions thereof. 
5 By an "E2 polypeptide" is meant a molecule derived from an HCV E2 region. 

The mature E2 region of HCV-1 begms at approximately amino acid 383-385, 
numbered relative to the full-length HCV-1 polyprotein. (See, Figures 1 and 3 A-3C. 
Amino acids 383-385 of Figures 3A-3C correspond to amino acid positions 211-213 
of SEQ ID N0:4.) A signal peptide begins at approximately amino acid 364 of the 

10 polyprotein. Thus, by an "E2 polypeptide" is meant either a precursor E2 protein, ' 
including the signal sequence, or a mature E2 polypeptide which lacks this sequence, 
or even an E2 polypeptide with a heterologous signal sequence. The E2 polypeptide 
includes a C-terminal m^brane anchor sequence which occurs at approximately 
amino acid positions 715-730 and may extend as far as approximately amino acid 

15 residue 746 (see, Lin et al., J, Virol (1994) M:5063-5073). An E2 polypeptide, as 
defined herein, may or may not include the C-terminal anchor sequence or portions 
thereof. Moreover, an E2 polypeptide may also include all or a portion of the p7 
region which occurs inmiediately adjacent to the C-terminus of E2. As shown in 
Figures 1 and 3A-3C, the p7 region is found at positions 747-809, numbered relative 

20 to the full-length HCV-1 polyprotein (ammo acid positions 575-637 of SEQ ID 

N0:4). Additionally, it is known that multiple species of HCV E2 exist (Spaete et al., 
Virol (1992)188:819-830; Selbyetal.,^ Virol (1996) 70:5177-5182; Grakouietal., 
1 Virol (1993) 67:1385-1395; Tomeietal,^: Virol (1993)67:4017-4026). 
Accordingly, for purposes of the present invention, the term "E2" encompasses any of 

25 these species of E2 including, without limitation, species that have deletions of 1-20 
or more of flie amino acids from the N-tenninus of the E2, such as, e.g, deletions of 1, 
2, 3, 4, 5....10...15, 16, 17, 18, 19... etc. amino acids. Such E2 species include tiiose 
beginning at amino acid 387, amino acid 402, amino acid 403, etc. 

Representative El and E2 regions J&om HCV-1 are shown in Figures 3A-3C 

30 and SEQ ID N0:4. For purposes of the present invention, the El and E2 regions are 
defined with respect to the amino acid number of the polyprotein encoded by the 
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genome of HC V- 1 , with the initiator methionine being designated position 1 . See, 
e.g., Choo et al., Proc. Natl Acad Sci USA (1991) M:2451-2455. However, it 
should be noted that the term an "El polypeptide" or an "E2 polypeptide" as used 
herein is not limited to the HCV-1 sequence. In this regard, the corresponding El or 
5 E2 regions in other HCV isolates can be readily determined by aligning sequences 
from the isolates in a manner that brings the sequences into maximum aUgnment. 
This can be performed with any of a number of computer software packages, such as 
ALIGN 1.0, available from the University of Vurgima, Department of Biochemistry 
(Attn: Dr. William R. Pearson). See, Pearson et al., Proc, Natl Acad Sci USA (1988) 

10 ^5:2444-2448. 

Furthermore, an "El polypeptide'' or an "E2 polypeptide" as defined herein is 
not limited to a polypeptide having the exact sequence depicted in the Figures. 
Indeed, the HCV genome is in a state of constant flux in vivo and contains several 
variable domains which exhibit relatively high degrees of variability between isolates. 

15 A number of conserved and variable regions are known between these strains and, in 
general, the amino acid sequences of epitopes d^ved from these regions will have a 
high degree of sequence homology, e.g., amino acid sequence homology of more than 
30%, preferably more than 40%, more than 60%, and even more than 80-90% 
homology, when the two sequences are aligned. It is readily apparent that the terms 

20 encompass El and E2 polypeptides from any of the various HCV strains and isolates 
mcluding isolates havmg any of the 6 genotypes of HCV described in Simmonds et 
al., J. GevL Virol (1993) 74:2391-2399 (e.g., strains 1, 2, 3, 4 etc.), as well as newly 
identified isolates, and subtypes of these isolates, such as HCVla, HCVlb etc. 

Thus, for example, the term "El" or "E2" polypeptide refers to native El or E2 

25 sequences from any of the various HCV strains, as well as analogs, muteins and 

immunogenic fragments, as defined fiirther below. The complete genotypes of many 
of these strains are known. See, e.g,, U.S. Patent No. 6,150,087 and GenBank 
Accession Nos. AJ238800 and AJ238799. 

Additionally, the temis "El polypeptide" and "E2 polypeptide" encompass 

30 proteins which include modifications to the native sequence, such as internal 
deletions, additions and substitutions (generally conservative in nature). These 
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modifications may be deliberate, as through site-directed mutagenesis, or may be 
accidental, such as through naturally occurring mutational events. All of these 
modifications are encompassed in the present invention so long as the modified El 
and E2 polypeptides fiinction for their intended purpose. Thus, for example, if the El 
5 and/or E2 polypeptides are to be used in vaccine compositions, the modifications must 
be such that immunological activity (i.e., the abiUty to eUcit a humoral or cellular 
immune response to the polypeptide) is not lost. 

By "E1E2" complex is meant a protein containing at least one El polypeptide 
and at least one E2 polypeptide, as described above. Such a complex may also 

10 include all or a portion of the p7 region which occurs immediately adjacent to the C- 
terminus of E2. As shown in Figures 1 and 3A-3C, the p7 region is found at positions 
747-809, numbered relative to the full-length HCV-1 polyprotein (amino acid 
positions 575-637 of SEQ ID N0:4). A representative E1E2 complex which includes 
the p7 protein is termed "ElE2go5" herein. 

15 The mode of association of El and E2 in an E1E2 complex is immaterial. The 

El and E2 polypeptides may be associated fhrou^ non-covalent interactions such as 
through electrostatic forces, or by covalent bonds. For example, the E1E2 
polypeptides of the presait invention may be in the form of a fusion protein which 
includes an immunogenic El polypeptide and an immunogenic E2 polypeptide, as 

20 defined above. The fusion may be ejqjressed fiiom a polynucleotide encoding an E1E2 
diimera. Alternatively, E1E2 complexes may form spontaneously simply by mixing 
El and E2 proteins vMch have been produced individually. Similarly, when co- 
expressed and secreted into media, the El and E2 proteins can form a complex 
spontaneously. Thus, the term encompasses E1E2 complexes (also called aggregates) 

25 that spontaneously form upon purification of El and/or E2. Such aggregates may 
include one or more El monomers in association with one or more E2 monomers. 
The number of El and E2 monomers present need not be equal so long as at least one 
El monomer and one E2 monomer are present. Detection of the presence of an E1E2 
complex is readily determined using standard protein detection techniques such as 

30 polyaraylamide gel electrophoresis and immunological techniques such as 
immunoprecipitation. 
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The tenns "analog" and "mutein" refer to biologically active derivatives of the 
reference molecule, or ftagments of such derivatives, that retain desired activity, such 
as the ability to stimulate a cell-mediated immune response, as defined below. In 
general, the term "analog** refers to compounds having a native polypeptide sequence 
5 and structure with one or more amino acid additions, substitutions (generally 

conservative in nature) and/or deletions, relative to the native molecule, so long as the 
modifications do not destroy fanmunogenic activity. The term **mutein" refers to 
peptides having one or more peptide mimics ("peptoids"), such as those described in 
International Publication No. WO 91/04282. Preferably, the analog or mutein has at 
10 least the same immunoactivity as the native molecule. Methods for making 

polypeptide analogs and muteins are known in the art and are described further below. 

As explained above, analogs generally mclude substitutions that are 
conservative m nature, i.e., those substitutions that take place within a fiimily of amino 

1 5 acids that are related in their side chains. Specifically, amino acids are generally 
divided mto four femilies: (1) acidic - aspartate and glutamate; (2) basic - lysine, 
arginine, histidine; (3) non-polar - alanme, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan; and (4) uncharged polar - glycine, asparagine, 
glutamine, cysteine, serine threonine, tyrosine. Phenylalanine, tryptophan, and 

20 tyrosine are sometimes classified as aromatic amino acids. For example, it is 
reasonably predictable that an isolated replacement of leucine with isoleucine or 
valine, an aspartate with a glutamate, a threonine with a serine, or a similar 
conservative replacement of an amino acid with a structurally related amino acid, will 
not have a major effect on the biological activity. For example, the polypeptide of 

25 interest may include up to about 5-10 conservative or non-conservative amino acid 
substitutions, or even up to about 15-25 conservative or non-conservative amino acid 
substitutions, or any integer between 5-25, so long as the desired fimction of the 
molecule remains intact. One of skill in the art may readily determme regions of the 
molecule of interest that can tolerate change by reference to Hopp/Woods and Kyte- 

30 Doolittle plots, well known in the art. 
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By "modified NS3" is meant an NS3 polypeptide with a modification such that 
protease activity of the NS3 polypeptide is disrupted. The modification can include 
one or more amino acid additions, substitutions (generally non-consorvative in nature) 
and/or deletions, relative to ttie native molecule, wherein the protease activity of the 
5 NS3 polypeptide is disnq)ted. Methods of measuring protease activity are discussed 
further below. 

By "fiagment" is intended a polypeptide consisting of only a part of the intact 
fiill-length polypeptide sequence and structure. The fiagment can include a C- 
tenninal deletion and/or an N-termmal deletion of the native polypeptide. An 

10 *Hmmunogenic firagmenf ' of a particular HCV protein will generally include at least • 
about 5-10 contiguous amino acid residues of the fiilHength molecule, preferably at 
least about 15-25 contiguous amino acid residues of the fiill-length molecule, and 
most preferably at least about 20-50 or more contiguous amino acid residues of the 
fiill-length molecule, that defme an epitope, or any integer between 5 amino acids and 

15 the fiill-length sequence, provided that the firagment in question retains immunogenic 
activity, as measured by the assays described hereiiL 

The term "epitope" as used herem refers to a sequence of at least about 3 to 5, 
preferably about 5 to 10 or 15, and not more than about 1,000 amino acids (or any 
integer therebetween), which define a sequence that by itself or as part of a larger 

20 sequence, binds to an antibody generated m response to such sequence. There is no 
critical upper limit to the Iragth of the fiagment, which may comprise nearly the fiill- 
length of the protein sequence, or even a fiision protem comprising two or more 
epitopes fi-om the HCV polyprotem. An epitope for use in the subject invention is not 
limited to a polypeptide having the exact sequence of the portion of the parent protein 

25 from which it is derived. Indeed, viral genomes are m a state of constant flux and 
contain several variable domains which exhibit relatively high degrees of variability 
between isolates. Thus the term "epitope" encompasses sequences identical to the 
native sequence, as well as modifications to the native sequence, such as deletions, 
additions and substitutions (generally conservative in nature). 

30 Regions of a given polypeptide that include an epitope can be identified using 

any number of epitope mapping techniques, well known in the art. See, e.g.. Epitope 
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Mapping Protocols in Methods in Molecular Biology, Vol. 66 (Glenn E. Morris, Ed, 
1996) Hmnana Press, Totowa, New Jersey. For example, linear epitopes may be 
detewnined by e.g., concurrently synthesizing large numbers of peptides on solid 
supports, the peptides corresponding to portions of the protein molecule, and reacting 
5 the peptides with antibodies while the peptides are still attached to the supports. Such 
techniques are known in the art and desaibed in, e.g., U.S. Patent No. 4,708,871 ; 
Geysenetal. (1984) Proc. NatL Acad. Sci. 81:3998-4002; Geysen et al. 
(1986) M?fec. Immunol 21:709-715. Similarly, conformational epitopes are readily 
identified by determining spatial confoimation of amino acids such as by, e.g., x-ray 

10 crystallography and 2-dimaisional nuclear magnetic resonance. Sec, e.g., Epitope 
Mapping Protocols, supra. Antigenic regions of proteins can also be identified using 
standard antigenicity and hydropathy plots, such as those calculated using, e.g., the 
Omiga version 1 .0 software program available fi-om the Oxford Molecular Group. 
This computer program employs the HoppAVoods method, Hopp et al., Proc. Natl. 

1 5 Acad. Sci (1 98 1 ) 2S:3824-3828 for determining antigenicity profiles, and the 

Kyte-DooUttte technique, Kyte et aL, J. Mol. Biol (1982) 152:105-132 for hydropathy 
plots. 

For a desCTiption of various HCV epitopes, see, e.g., Chien et al., Proc. Natl 
Acad. Sci. USA (1992) 52:1001 1-10015; Chien et al., J. Gastroent. Hepatol (1993) 
20 S:S33-39; Chien et al.. International Publication No. WO 93/00365; Chien, D.Y., 
Ihtemational Publication No. WO 94/01778; and U.S. Patent Nos. 6,280,927 and 
6,150,087. 

As used herein, the term "conformational epitope" refers to a portion of a full- 
length protein, or an analog or mutein th^eof, having structural features native to the 
amino acid sequence encoding the epitope within the full-length natural protein. 
Native structural features include, but are not limited to, glycosylation and three 
dimensional structure. Preferably, a conformational epitope is produced 
recombinantly and is ejcpressed in a cell &om which it is extractable under conditions 
which preserve its desired structural features, e.g. without denaturation of the epitope. 
Such cells include bacteria, yeast, insect, and mammalian cells. Expression and 
isolation of recombinant conformational epitopes fix)m the HCV polyprotein are 
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described in e.g., International Publication Nos. WO 96/04301, WO 94/01778, WO 
95/33053, WO 92/08734. 

As used herein the term "T-cell epitope" refers to a feature of a peptide 
structure which is capable of inducing T-cell immunity towards the peptide structure 
5 or an associated hapten. T-cell epitopes generally comprise linear peptide 

determinants that assume extended conformations within the peptide-binding cleft of 
MHC molecules, (Unanue et al., Science (1987) 236:551-557). Conversion of 
polypeptides to MHC class U-associated linear peptide determinants (generally 
between 5-14 aixuno acids in length) is termed "antigen processing" which is carried 

10 out by antigen presenting cells (APCs). More particularly, a T-cell epitope is defined 
by local features of a short peptide structure, such as primary amino acid sequence 
properties involving charge and hydrophobicity, and certain types of secondary 
structure, such as helicity, that do not depend on the folding of the entire polypeptide. 
Further, it is believed that short peptides capable of recognition by helper T-cells are 

1 5 generally amphipathic structures comprising a hydrophobic side (for interaction with 
the MHC molecule) and a hydrophilic side (for interacting with the T-cell receptor), 
(Margalit et al.. Computer Prediction of T-cell Epitopes, New Generation Vaccines 
Marcel-Dekker, Inc, ed G.C. Woodrow et al., (1990) pp. 109-1 16) and further that tiie 
amphipathic structures have an a-helical conJBguration (see, e.g., Spouge et al., J, 

20 Immunol (1987) 1M:204-212; Berkower et al., J, Immunol (1986) 136:2498-2503). 
Hence, segments of proteins that include T-cell epitopes can be readily 
predicted using numerous computer programs. (See e.g., Margalit et al.. Computer 
Prediction of T-cell Epitopes, New Generation Vaccines Marcel-Dekker, Inc, ed. G.C. 
Woodrow et al., (1990) pp. 109-1 16). Such programs generally compare the amino 

25 acid sequence of a peptide to sequences known to induce a T-cell response, and search 
for patterns of amino acids which are believed to be required for a T-cell epitope. 

An "immunological response" to an HCV antigen (mcluding both polypeptide 
and polynucleotides encoding polypeptides that are expressed in vivo) or composition 
is the development in a subject of a humoral and/or a cellular immune response to 

30 molecules present in the composition of interest For purposes of the present 

invention, a '^humoral immune response" refers to an inmiune response mediated by 
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antibody molecules, while a "cellular immune response" is one mediated by T- 
lymphocytes and/or other white blood cells. One important aspect of cellular 
immunity involves an antigen-specific response by cytolytic T-cells ("CTLs"). CTLs 
have specificity for peptide antigens that are presented in association with proteins 
5 encoded by the major histocompatibility complex (MHC) and expressed on the 
surfaces of cells. CTLs help induce and promote the intracellular destruction of 
intracellular microbes, or the lysis of cells infected with such microbes. Another 
aspect of cellular inununity involves an antigen-specific response by helper T-cells. 
Helper T-cells act to help stimulate the function, and focus the activity of, nonspecific 
10 effector cells against cells displaying peptide antigens in association with MHC 
molecules on their surface. A "cellular immune response" also refers to the 
production of cytokines, chemokines and other such molecules produced by activated 
T-cells and/or other white blood cells, including those derived firom CD4+ and CD8+ 
T-cells. 

15 A composition or vaccine that elicits a cellular immune response may serve to 

sensitize a vertebrate subject by the presentation of antigen in association with MHC 
molecules at the cell surface. The cell-mediated immune response is directed at, or 
near, cells presenting antigen at their sur&ce. In addition, antigen-specific T- 
lymphocytes can be gaierated to allow for the fixture protection of an unmunized host. 

20 

The ability of a particular antigen to stimulate a cell-mediated immunological 
response may be determined by a number of assays, such as by lymphoproliferation 
(lymphocyte activation) assays, CTL cytotoxic cell assays, or by assaying for T- 
lymphocytes specific for the antigen in a sensitized subject. Such assays are well 

25 known in the art. See, e.g., Erickson et al,, J. Immunol (1993) 151 :4189-4199; Doe et 
al., Eur. J, Immunol (1994) 24:2369-2376; and the examples below. 

Thus, an immunological response as used herein may be one \sduch stimulates 
the production of CTLs, and/or the production or activation of helper T- cells. The 
antigen of interest may also elicit an antibody-mediated immune response. Hence, an 

30 inununological response may include one or more of the following effects: the 

production of antibodies by B-cells; and/or the activation of suppressor T-cells and/or 
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y8 T-cells directed specifically to an antigen or antigens present in the composition or 
vaccine of interest. These responses may serve to neutralize infectiyity, and/or 
mediate antibody-complement, or antibody dependent cell cytotoxicity (ADCC) to 
provide protection or alleviation of symptoms to an immimized host Such responses 
5 can be determined using standard inmiunoassays and neutralization assays, well 
known in the art. 

By "equivalent antigenic determinant" is meant an antigenic determinant from 
different sub-species or strains of HCV, such as fiom strains 1, 2, 3, etc., of HCV 
which antigenic determinants are not necessarily identical due to sequence variation, 

10 but which occur in equivalent positions in the HCV sequence in question. In general 
the amino acid sequences of equivalent antigenic determinants will have a high degree 
of sequence homology, e.g., amino acid sequence homology of more than 30%, 
usually more than 40%, such as more than 60%, and even more than 80-90% 
homology, when the two sequences are aligned, 

1 5 A "coding sequence" or a sequence which "encodes" a selected polypeptide, is 

a nucleic acid molecule which is transcribed (in the case of DNA) and translated (in 
the case of mKNA) into a polypeptide in vitro or in vivo vAxcn placed under flie 
control of appropriate regulatory sequences. The boundaries of the coding sequence 
are determined by a start codon at the 5' (amino) terminus and a translation stop codon 

20 at the 3* (carboxy) terminus. A transcription termination sequence may be located 3' 
to the coding sequence. 

A "nucleic acid" molecule or "polynucleotide" can include both double- and 
single-stranded sequences and refers to, but is not lunited to, cDNA from vbal, 
procaryotic or eucaryotic mRNA, genomic DNA sequences from viral (e.g. DNA 

25 viruses and retroviruses) or procaryotic DNA, and especially synthetic DNA 

sequences. The term also captures sequences that include any of the known base 
analogs of DNA and RNA. 

An "HCV polynucleotide" is a polynucleotide that encodes an HCV 
polypeptide, as defmed above. 

30 "Operably linked" refers to an arrangement of elements wherein the 

components so described are configured so as to perform their desired function. Thus, 
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a given promoter operably linked to a coding sequence is capable of effecting the 
expression of the coding sequence when the proper transcription fectors, etc., are 
present The promoter need not be contiguous with the coding sequence, so long as it 
functions to direct the expression thereof. Thus, for example, intervening untranslated 
5 yet transcribed sequences can be present between the promoter sequence and the 
coding sequence, as can transcribed inttons, and the promoter sequence can still be 
considered "operably linked" to the codmg sequence. 

"Recombinant as used herein to describe a nucleic acid molecule means a 
polynucleotide of genomic, cDNA, viral, semisynthetic, or synthetic origin which, by 

10 virtue of its origin or manipulation is not associated with all or a portion of the 

polynucleotide with which it is associated in nature. The term "recombinant" as used 
with respect to a protein or polypeptide means a polypeptide produced by expression 
of a recombinant polynucleotide. In general, the gene of interest is cloned and then 
expressed in transformed organisms, as described furttier below. The host organism 

1 5 expresses the foreign gene to produce the protein under expression conditions. 

A "control element" refers to a polynucleotide sequence which aids in the 
expression of a coding sequence to which it is linked. The term includes promoters, 
transcription termination sequences, upstream regulatory domains, polyadenylation 
signals, untranslated regions, mcluding 5'-UTRs and 3'-UTRs and when appropriate, 

20 leader sequences and enhancers, vAach collectively provide for the transcription and 
translation of a coding sequrace in a host cell. 

A ^'promoter" as used herein is a DNA regulatory region capable of binding 
RNA polymerase in a host cell and mitiating transcription of a downstream (3* 
direction) coding sequence operably linked thereto. For purposes of the present 

25 invention, a promoter sequence includes the minimum number of bases or elements 
necessary to mitiate transcription of a gene of interest at levels detectable above 
background. Within the promoter sequence is a transcription initiation site, as well as 
protein binding domains (consensus sequences) responsible for the binding of RNA 
polymerase. Eucaryotic promoters will often, but not always, contain "TATA" boxes 

30 and "CAT" boxes. 
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A control sequence "directs the transcription" of a coding sequence in a cell 
when RNA polymerase will bind the promoter sequence and transcribe the coding 
sequence into mRNA, which is then translated into the polypeptide encoded by the 
coding sequence. 

5 "Expression cassette" or "expression construct" refers to an assembly which is 

capable of directing the expression of the sequence(s) or gene(s) of interest. The 
expression cassette mcludes control elements, as described above, such as a promoter 
which is operably linked to (so as to direct transcription of) the sequence(s) or gene(s) 
of interest, and often includes a polyadenylation sequence as well. Within certam 

10 embodiments of the invention, the expression cassette described herein may be 

contained within a plasmid construct. In addition to the components of the expression 
cassette, the plasmid construct may also include, one or more selectable markers, a 
signal which allows the plasmid construct to exist as single-stranded DNA (e.g., a 
M13 origin of replication), at least one multiple cloning site, and a "manunalian" 

15 origin of replication (e.g., a SV40 or adenovirus origin of replication). 

"Transformation," as used herein, refers to the insertion of an exogenous 
polynucleotide mto a host cell, irrespective of the method used for insertion: for 
example, transformation by direct uptake, transfection, infection, and the like. For 
particular methods of transfection, see further below. The exogenous polynucleotide 

20 may be maintained as a nonint^;rated vector, for example, an episome, or 
alternatively, may be integrated into the host genome. 

A 'liost cell" is a cell which has been transformed, or is capable of 
transformation, by an exogenous DNA sequence. 

By "isolated" is meant, when referring to a polypeptide, that the indicated 

25 molecule is separate and discrete from the vvliole organism with which the molecule is 
found in nature or is present in the substantial absence of other biological macro- 
molecules of the same type. The term "isolated" with respect to a polynucleotide is a 
nucleic acid molecule devoid, in whole or part, of sequences normally associated with 
it in nature; or a sequence, as it exists in nature, but having heterologous sequences in 

30 , association therewith; or a molecule disassociated from the chromosome. 
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The tenn "purified" as used herein preferably means at least 75% by weight, 
more preferably at least 85% by weight, more preferably still at least 95% by weight, 
and most preferably at le^ 98% by weight, of biological macromolecules of the same 
type are present 

5 "Homology" refers to the percent identity between two polynucleotide or two 

polypeptide moieties. Two DNA, or two polypeptide sequences are "substantially 
homologous" to each other when the sequences exhibit at least about 50% , preferably 
at least about 75%, more preferably at least about 80%-85%, preferably at least about 
90%, and most preferably at least about 95%-98%, or more, sequence identity over a 
10 defined length of the molecules. As used herein, substantially homologous also refers 
to sequences showing complete identity to the specified DNA or polypeptide 
sequence. 

In general, "identity" refers to an exact nucleotide-to-nucleotide or amino acid- 
to-amino acid correspondence of two polynucleotides or polypeptide sequences, 

15 respectively. Percent identity can be determined by a dkect comparison of the 

sequence information between two molecules by aligning the sequences, counting the 
exact number of matches between the two aligned sequences, dividing by the length of 
the shorter sequence, and multiplying the result by 100. Readily available computer 
programs can be used to aid in the analysis, such as ALIGN, Dayhoff, M.O. in Atlas of 

20 Protein Sequence and Structure M.O. Dayhoff ed., 5 Suppl. 1:353-358, National 

biomedicail Research Foundation, Washington, DC, which adapts the local homology 
algorithm of Smith and WatenamAdvances inAppl Math. 2:482-489, 1981 for 
peptide analysis. Programs for determining nucleotide sequence identity are available 
in the Wisconsin Sequence Analysis Package, Version 8 (available firom Genetics 

25 • Computer Group, Madison, WI) for example, the BESTFIT, FASTA and GAP 

programs, which also rely on the Smith and Waterman algorithm. These programs are 
readily utilized with the default parameters recommended by the manufacturer and 
described in the Wisconsin Sequence Analysis Package referred to above. For 
example, percent identity of a particular nucleotide sequence to a reference sequence 

30 can be determined using the homology algorithm of Smith and Waterman with a 
default scoring table and a gap penalty of six nucleotide positions. 
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Another method of estabUshing percent identity in the context of the present 
invention is to use the MPSRCH pack^e of programs copyrighted by the University 
of Edinburgh, developed by John F. Collins and Shane S. Sturrok, and distributed by 
IntelliGenetics, Tnc. (Mountain View, CA). From this suite of packages the Smith- 
5 Waterman algorithm can be employed where defeult paiametCTs are used for the 
scoring table (for example, gap open penalty of 12, gap extension penalty of one, and 
a gap of six). From the data generated the ♦•Match" value reflects "sequence identity." 
Other suitable programs for calculating the percent identity or similarity between 
sequences are generally known in the art, for example, another alignment program is 
1 0 BLAST, used with default parameters. For example, BLASTN and BLASTP can be 
used using the following default parameters: genetic code = standard; filter = none; 
strand = both; cutoff = 60; expect = 10; Matrix = BLOSUM62; Descriptions = 50 
sequences; sort by = HIGH SCORE; Databases = non-redundant, GenBank + EMBL + 
DDBJ + PDB + GenBank CDS translations + Swiss protein + Spupdate + PIR. 

15 Details of these programs can be found at the following internet address: 
http://www.ncbi.nlm.gov/cgi-bin/BLAST. . 

Alternatively, homology can be determined by hybridization of 
polynucleotides under conditions vMch form stable duplexes between homologous 
regions, followed by digestion with single-sti^inded-specific nuclease(s), and size 

20 determination of tiie digested fiagments. DNA sequences tiiat are substantially 
homologous can be identified in a Southem hybridization experiment under, for 
example, stiingent conditions, as defined for that particular system. Defining 
appropriate hybridization conditions is witiiin the skill of tiie art. See, e.g., Sambrook 
et al., supra; DNA Cloning, supra; Nucleic Acid Hybridization, supra. 

25 By "nucleic acid immunization" is meant tiie introduction of a nucleic acid 

molecule encoding one or more selected antigens into a host cell, for the in vivo 
expression of tiie antigen or antigens. The nucleic acid molecule can be introduced 
directiy into tiie recipient subject, such as by injection, inhalation, oral, inti^nasal and 
mucosal administi^ation, or tiie like, or can be intix»duced ex vivo, into cells which have 

30 been removed firom tiie host. In tiie latter case, tiie transformed cells are reintroduced 
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into the subject where an immune response can be mounted against the antigen 

encoded by the nucleic acid molecule. 

As used herein, 'ixeatmenf ' refers to any of (i) the prevention of infection or 

reinfection, as in a traditional vaccine, (ii) the reduction or elimination of symptoms, 
5 and (iii) the substantial or complete elimination of the pathogen m question. 

Treatment may be effected prophy lactically (prior to infection) or therapeutically 

(following infection). 

By *Srertebrate subject" is meant any member of the subphylum cordata, 

including, without limitation, humans and other primates, including non-human 
10 primates such as chimpanzees and other apes and monkey species; farm animals such 

as cattie, shbep, pigs, goats and horses; domestic mammals such as dogs and cats; 

laboratory animals includmg rodents such as mice, rats and guinea pigs; birds, 

including domestic, wild and game birds such as chickens, turkeys and other 

gallinaceous birds, ducks, geese, and the like. The term does not denote a particular 
15 age. Thus, both adult and newborn individuals are intended to be covered. The 

invention described herein is intended for use in any of the above vertehmte species, 

since the immune systems of all of these vertebrates operate similarly. 

n. Modes ofCarrving out the Invention 
20 Before describing the present invention in detail, it is to be understood that this 

invention is not limited to particular formulations or process parameters as such may, 
of course, vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments of the invention only, and is not 
intended to be limiting. 
25 Although a number of compositions and methods similar or equivalent to 

those described herein can be used in the practice of the present mvention, the 
preferred materials and methods are described herein. 

It is a discovery of the present invention that fusion protems, combinations of 
the individual components of these fusions, and polynucleotides encoding the same, 
comprising an NS3, an NS4, and an NS5a polypeptide with or witiiout a core 
polypeptide, or an NS3, an NS4, an NS5a, and an NS5b polypeptide, with or without a 
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core polypeptide, of an HCV viras can be used to activate HCV-specific T cells, i.e., T 
cells which recognize epitopes of these polypeptides. 

The present invention also pertains to compositions comprising HCV 
nonstructural fusion proteins and HCV E1E2 complexes, as well as compositions 
5 comprising polynucleotides encoding Ihe same or combinations of polypeptides and 
polynucleotides- 

The proteins, polynucleotides, compositions and combinations of the present 
invention can be used to stimulate a cellular immune response, such as to activate 
HCV-specific T cells, i.e., T cells which recognize epitopes of these polypeptides. 
10 Activation of HCV-specific T cells provides both in vitro and in vivo model systems 
for the development of HCV vaccines, particularly for identifying HCV polypeptide 
epitopes associated with a response. The compositions can also be used to generate an 
inmiune response against HCV in a mammal, particularly a CTL response for either 
therapeutic or prophylactic purposes. 

15 

Fusion Proteins 

The genomes of HCV strains contain a single open reading fiame of 
approximately 9,000 to 12,000 nucleotides, which is transcribed into a polyprotein. 
As shown in Figure 1 and the table below, an HCV polyprotein, upon cleavage, 

20 produces at least ten distinct products, in the order of NH2- Core-El -E2-p7-NS2-NS3- 
NS4a-NS4b-NS5a-NS5b-COOH. The core polypeptide occurs at positions 1-191, 
numbered relative to HCV-1 (see, Choo et al. (1991) Proc. Natl Acad Sci. USA 
M:2451-2455, for the HCV-1 genome). This polypeptide is further processed to 
produce an HCV polypeptide with approximately amino acids 1-173. The envelope 

25 polypeptides, El and E2, occur at about positions 1 92-383 and 384-746, respectively. 
The P7 domain is found at about positions 747-809. NS2 is an integral membrane 
protein with proteolytic activity and is found at about positions 810-1026 of the 
polyprotem. NS2, in combination with NS3, (found at about positions 1027-1657), 
cleaves the NS2-NS3 sissle bond vAnch in turn generates the NS3 N-terminus and 

30 releases a large polyprotein that includes both serine protease and RNA helicase 

activities. The NS3 protease, found at about positions 1027-1207, serves to process 
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the remaining polyprotein. The heUcase activity is found at about positions 1 193- 
1657. NS3 liberates an NS3 cofactor (NS4a, found about positions 1658-171 1), two 
proteins (NS4b found at about positions 1712-1972, and NS5a found at about 
positions 1973-2420), and an RNA-dependent KNA polymerase (NS5b found at about 
5 positions 2421-3011). Completion of polyprotein maturation is initiated by 

autocatafytic cleavage at the NS3-Ns4a junction, catalyzed by the NS3 serine protease. 



Domain 


Aoproximate Boundaries* 


C (core) 


1-191 


El 


192-383 


E2 


384-746 


P7 


747-809 


NS2 


810-1026 


NS3 


1027-1657 


NS4a 


1658-1711 


NS4b 


1712-1972 


NS5a 


1973-2420 


NSSb 


2421-3011 



♦Numbered relative to HCV-1. See, Choo et al. (1991) Proc Natl. Acad Sci. 
USA M:2451-2455. 



Fusion proteins for use in the compositions and methods, and polynucleotides 
encoding therefor, include or encode an NS3 polypeptide, anNS4 (NS4a and/or 

25 NS4b) polypeptide, an NS5a polypeptide and, optionally, an NS5b polypeptide. The 
fusion proteins may or may not include all or part of the core region. In certain 
embodiments, none of the core region is present in the compositions. The 
nonstructural regions need not be in the orda- in which they naturally occur in the 
native HCV polyprotein. Thus, for example, the NSSb polypeptide may be at the N- 

30 and/or C-terminus of the fusion or may be found internally. These polypeptides may 
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be derived from the same HCV isolate, or from different strains and isolates 
including isolates having any of the various HCV genotypes, to provide increased 
protection against a broad range of HCV genotypes. Additionally, polypeptides can 
be selected based on the particular viral clades endemic in specific geographic 
5 regions where vaccine compositions containing the fusions will be used. It is readily 
apparent that the subject fusions provide an eflFective means of treating HCV 
infection in a wide variety of contexts. 

In one embodiment, the fusion protein of the present invention includes an 
NS3 polypeptide that has been modified to inhibit protease activity, such that further 

1 0 cleavage of the fusion is uihibited. Hie NS3 polypeptide can be modified by deletion 
of all or a portion of the NS3 protease domain. Alternatively, proteolytic activity can 
be inhibited by substitutions of amino acids within active regions of the protease 
domain. Finally, additions of amino adds to active regions of the domain, such that 
the catalytic site is modified, will also serve to mhibit proteolytic activity. 

15 As explained above, the protease activity is found at about amino acid 

positions 1027-1207, numbered relative to the full-length HCV-1 polyprotein (see, 
Choo et al., Proc, Natl Acad, Set USA (1991) 88:2451-2455), positions 2-182 of 
Figure 3. The structure of the NS3 protease and active site are known. See, e.g., De 
Francesco et al,,Araivir, Ther. (1998) 3:99-109; Koch et al.. Biochemistry (2001) 

20 40:63 1-640. Thus, deletions or modifications to the native sequence will typically 
occur at or near the active site of the molecule. Particularly, it is desirable to modify 
or make deletions to one or more amino acids occurring at positions 1- or 2-1 82, 
preferably 1- or 2-170, or 1- or 2-155 of Figure 3. Preferred modifications are to the 
catalytic triad at the active site of the protease, i.e., H, D or S residues, in order to 

25 inactivate the protease. These residues occur at positions 1083, 1 105 and 1 165, 

respectively, numbered relative to the full-length HCV polyprotein (positions 58, 80 
and 140, respectively, of Figure 3). Such modifications will suppress proteolytic 
cleavage while maintaining T-cell epitopes. 

One of skill in the art can readily determine portions of the NS3 protease to delete in 
30 order to disrupt activity. The presence or absence of activity can be determined using 
methods known to those of skill in tiie art 
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For example, protease activity or lack thereof may be determined using assays 
well known in the art. See, e.g., Takeshita et al., AnaL Biochem. (1997) 247:242- 
246; Kakiuchi et al., J. Biochent (1997) 122:749-755; Sali et al., Biochemistry 
(1998) 22:3392-3401; Cho et al., /. Virol, Meth (1998) ^:109-1 15; Cerretani et al., 
5 Anal Biochem. (1999) 266:192-197; Zhang etal.,^«a/. Biochem, (1999) 270:268- 
275; Kakiuchi et al., J. Virol Meth (1999) 8Q:77-84; Fowler et al., J. Biomol 
Screen. (2000) 5:153-158; and Kim et al.,i4«a/. Biochem. (2000) 2M:42-48. 

The NS3, NS4, NS5a, and NS5b polypeptides present in the various fosions 
described above can either be fiill-length polypeptides or portions of NS3, NS4 

10 (NS4a and/or NS4b), NS5a, andNSSb polypeptides. The portions of NS3, NS4, 
NS5a, and NS5b polypeptides making up the fusion protein preferably comprise at 
least one epitope, which is recognized by a T cell receptor on an activated T cell, 
such as 2152-HEYPVGSQL-2160 (SEQ ID NO:l) and/or 2224- 
AELIEANLLWRQEMG-2238 (SEQ ID N0:2). Epitopes of NS3, NS4 (NS4a and 

15 NS4b), NS5a, NS5b, NS3NS4NS5a, and NS3NS4NS5aNS5b can be identified by 
several methods. For example, NS3, NS4, NS5a, NS5b polypeptides or fusion 
proteins comprising any combination of the above, can be isolated, for example, by 
immunoaffinity purification using a monoclonal antibody for the polypq)tide or 
protein. The isolated protein sequence can then be screened by preparing a series of 

20 short peptides by proteolytic cleavage of the purified protein, which together span the 
entire protein sequence. By starting with, for example, 100-mer polypeptides, each 
polypeptide can be tested for the presence of epitopes recognized by a T-cell receptor 
on an HCV-activated T cell, progressively smaller and overlapping fragments can 
then be tested from an identified 100-mer to map the epitope of interest. 

25 Epitopes recognized by a T-cell receptor on an HCV-activated T cell can be 

identified by, for example, ^^Cr release assay or by lymphoproliferation assay (see the 
examples). In a ^^Cr release assay, target cells can be constructed that display the 
epitope of interest by cloning a polynucleotide encoding the q)itope into an 
expression vector and transforming the expression vector into the target cells. HCV- 

30 specific CD8^ T cells will lyse target cells displaying, for example, an NS3, NS4, 
NS5a, NS5b, NS3NS4NS5a, or NS3NS4NS5aNS5b epitope and will not lyse cells 
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that do not display such aa epitope. In a lymphoproliferation assay, HCV-activated 
CD4^ T cells will proliferate when cultured with, for example, an NS3, NS4, NS5a, 
NS5b, NS3NS4NS5a, or NS3NS4NS5aNS5b epitopic peptide, but not in the absence 
of an HC V epitopic peptide. 
5 NS3, NS4, NSSa, and NS5b polypeptides can occur in any order in the fusion 

protein. If desired, at least 2, 3, 4, 5, 6, 7, 8, 9, or 10 or more of one or more of the 
polypq)tides may occur in the fusion protein. Multiple viral strains of HCV occur, 
and NS3, NS4, NSS^, and NS5b polypeptides of any of these strains can be used in a 
fusion protein. A representative fusion protem for use in the present invention is 
10 shown if Figures 5A-5J. The depicted sequence includes amino acids 1242-301 1 of 
the HCV polyprotein (representing polypeptides j&om NS3, NS4, NSSa and NS5b) 
with amino acids 1-121 of the HCV polyprotein (representing a polypeptide from the 
core region) fused to the C-terminus of NSSb. This numbering is relative to the 
HCV-1 polyprotein. 

15 Nucleic acid and amino acid sequences of a number of HCV strains and 

isolates, including nucleic acid and amino acid sequences of NS3, NS4, NSSa, NSSb 
genes and polypeptides have been deteraiined. For example, isolate HCV Jl . 1 is 
described in Kubo et al ,(1989) Japan. NucL Acids Res. 17: 10367-10372; Takeuchi 
et a/.(1990) Gene 91 :287.291; Takeuchi et al (1990) J. Gen. Virol. 71 :3027-3033; 

20 and Takeuchi et al (1990) Nucl. Acids Res. 18:4626. The complete coding 

sequences of two independent isolates, HCV-J and BK, are described by Kato et al, 
: (1990) Proc. Natl. Acad. Sci. USA 87;9S24-9528 and Takamizawa et al, (1991) J. 
Virol. 65: 1 105-1 1 13 respectively. 

Publications that describe HCV-1 isolates include Choo et al (1990) Brit. 

25 Med. Bull. 46:423-441; Choo et al (1991) Proc. Natl. Acad. Sci. USA 88:2451-2455 
and Han et al (1991) Proc. Natl. Acad. Sci. USA 88:1711-1715. HCV isolates 
HC-Jl and HC-J4 are described in Okamoto et al (1991) Japan J. Exp. Med. 
60:167-177. HCV isolates HCT 18-, HCT 23, Th, HCT 27, ECl and EClO are 
described in Weiner et al (1991) Virol. 180:842-848. HCV isolates Pt-l, HCV-Kl 

30 and HCV-K2 are described in Enomoto et al (1990) Biochem. Biophys. Res. 
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Commun. 170:1021-1025. HCV isolates A, C, D & E are described in 
Tsukiyama-Kohara et al (1991) Vims Genes 5:243-254. 

Each of the NS3, NS4, NS5a, and NS5b components of a fusion protein can 
be obtained from the sgme HCV strain or isolate or from different HCV strains or 
5 isolates. Fusion proteins comprising HCV polypeptides from, for example, the NS3 
polypeptide can be derived from a first strain of HCV, and the NS4, and NS5a 
polypeptides can be derived from a second strain of HCV. Alternatively, the NS4 
polypeptide can be derived from a first strain of HCV, and the NS3 and NS5a 
polypeptides can be derived fix)m a second strain of HCV. Optionally, the NS5a 

10 polypeptide can be derived from a first strain of HCV, and the NS3 and NS4 

polypeptides can be derived from a second strain of HCV. NS3, NS4 and NS5a 
polypeptides that are each derived from different HCV strains can also be used in an 
HCV fusion protein. Similarly, in a fusion protein comprising NS5b, at least one of 
the NS3, NS4, NS5a, and NS5b polypeptides can be derived from a different HCV 

15 strain than the other polypeptides. Optionally, NS3, NS4, NS5a, and NS5b 

polypeptides that are' each derived from different HCV strains can also be used in an 
NS3NS4NS5aNS5b fijsion protein. 

In addition to NS3, NS4a, NS4b, NS5a and NS5b, the fusion proteins can 
contain other polypeptides daived from the HCV polyproteiiL For example, it may 

20 be desirable to include polypeptides derived from the core region of the HCV 
polyprotein. This region occurs at amino acid positions 1-191 of the HCV 
polyprotein, numbered relative to HCV-1 . Either the full-length protein, fragments 
thereof, such as amino acids 1-150, e.g., amino acids 1-130, 1-120, for example, 
amino acids 1-121, 1-122, 1-123, etc., or smaller fragments containing epitopes of 

25 the full-length protein may be used in the subject fusions, such as those epitopes 
found between smiiio Ecids 10-53, smino scids 10-45, amino scids 67-88, amino 
acids 120-130, or any of the core q)itopes identified in, e.g., Houghton et al., U.S. 
Patent No. 5,350,671; Chien et al., Proc Natl. Acad. Sci. USA (1992) ^:10011- 
10015; Chien et al., J. Gastroent. Hepatol. (1993) 8:S33-39; Chien et al., 

30 International Publication No. WO 93/00365 ; Chien, D. Y., International Publication 
No. WO 94/01778; and U.S. Patent Nos. 6,280,927 and 6,150,087. Moreover, a 
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protein resxxlting from a frameshift in the core region of the polyprotein, such as 
described in International Publication No. WO 99/63941, may be used. The fusions 
may also contain polynucleotides encoding EIE2 polypeptides, as described further 
below. 

5 Preferably, the above-described fusion proteins, as well as the individual 

components of these proteins, are produced recombinantly. A polynucleotide 
encoding these proteins can be introduced into an expression vector which can be 
expressed in a suitable expression system. A variety of bacterial, yeast, manomalian 
and insect expression systems are available in the art and any such expression system 

10 can be used. Optionally, a polynucleotide encoding these proteins can be translated 
in a cell-free translation system. Such methods are well known in the art. The 
proteins also can be constructed by solid phase protein synthesis. 

If desired, the fusion proteins, or the individual components of these proteins, 
also can contain other non-HCV amino acid sequences, such as amino acid linkers or 

1 5 signal sequences, as well as Ugands usefid in protein purification, such as 
glutathione-S-transferase and staphylococcal protein A 

E1E2 Polypeptides 

As explained above, the compositions of the present invention may also 

20 include El and E2 polypeptides, complexes of these polypeptides or polynucleotides 
encoding the same. The El and E2 polypeptides and complexes thereof can be 
provided independent of the nonstructural fusion protein or can be incorporated into 
the same fusion. Moreover, E1E2 complexes can be provided as proteins, or as 
polynucleotides encoding the same. 

25 In this regard. El , E2 and p7 are known to contain human T-cell epitopes 

(both CD4+ and CD8+) and including one or more of these epitopes serves to 
increase vaccine efficacy as well as to increase protective levels against multiple 
HCV genotypes. Moreover, multiple copies of specific, conserved T-cell epitopes 
can also be used in E1E2 complexes, such as a composite of epitopes from different 

30 genotypes. 
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As explained above, the El and E2 polyp^tides that make up the E1E2 
complexes can be associated either through non-covalent or covalent interactions. 
Such complexes may be made up of immunogenic fragments of El and E2 which 
comprise epitopes. For example, fragments of El polypeptides can comprise from 
5 about 5 to nearly the full-lengfli of the molecule, such as 6, 10, 25, 50, 75, 100, 125, 
150, 175, 185 or more amino acids of an El polypeptide, or any integer between the 
stated numbers. Similarly, fragments of E2 polypeptides can comprise 6, 10, 25, 50, 
75, 100, 150, 200, 250, 300, or 350 amino acids of an E2 polypeptide, or any integer 
between the stated numbers. The El and E2 polypeptides may be from the same or 

10 different HCV strains. For example, epitopes derived from, e.g., the hypervariable 
region of E2, such as a region spanning amino acids 384-410 of 390-410, can be 
included in the E2 polypeptide. A particularly effective E2 epitope to incorporate 
into the E2 sequence or E1E2 complexes is one which includes a consensus sequence 
derived from this region, such as the consensus sequence Gly-Ser-Ala-Ala-Arg-Thr- 

1 5 Thr-Ser-Gly-Phe-Val-Ser-Leu-Phe-Ala-Pro-Gly-Ala-Lys-Ghi.Asn (SEQ ID N0:5), 
which represents a consensus sequence for amino acids 390-410 of the HCV type 1 
genome. Additional epitopes of El and E2 are known and described in, e.g., Chien 
et al., Intemational Publication No. WO 93/00365. 

Moreover, the El and E2 polypeptides may lack all or a portion of the 

20 membrane sparming domain. The membrane anchor sequence functions to associate 
the polypeptide to the endoplasmic reticulum. Normally, such polypeptides are 
capable of secretion into growth medium in which an organism expressing the 
protein is cultured. However, as described in Intemational Publication No. WO 
98/50556, such polypeptides may also be recovered intracellularly. Secretion into 

25 growth medium is readily determined using a number of detection techniques, 

including, e.g., polyacrylamide gel electrophoresis and the like, and immunological 
techniques such as immunoprecipitation assays as described in, e.g., Intemational 
Publication No. WO 96/04301, published February 15, 1996. With El, generally 
polypeptides terminating with about amino acid position 370 and higher (based on 

30 the numbering of HCVl El) will be retained by the ER and hence not secreted into 
growth media. With E2, polypeptides terminating with about amino acid position 
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731 and higher (also based on the numbering of the HCVl E2 sequence) will be 
retained by the ER and not secreted. (See, e.g.. International Publication No. WO 
96/04301, published February 15, 1996). It should be noted that these amino acid 
positions are not absolute and may vary to some degree. Thus, the present invention 
contemplates the use of El and E2 polypeptides which retain the transmembrane 
bmding domain, as well as polypeptides which lack all or a portion of the 
transmembrane binding domain, including El polypeptides terminating at about 
amino acids 369 and lower, and £2 polypeptides, terminating at about amino acids 
730 and lower, are intended to be captured by the present invention. Furthermore, 
the C-terminal truncation can extend beyond the transmembrane spanning domain 
towards the N-terminus. Thus, for example, El truncations occurring at positions 
lower than, e.g., 360 and E2 truncations occurring at positions lower than, e.g., 715, 
are also encompassed by the present invention. All that is necessary is that the 
truncated El and E2 polypeptides remain functional for their intended purpose. 
However, particularly preferred truncated El constructs are those that do not extend 
beyond about amino acid 300. Most preferred are those terminating at position 360. 
Preferred truncated E2 constructs are those with C-terminal truncations that do not 
extend beyond about amino acid position 715. Particularly preferred E2 tnmcations 
are those molecules truncated after any of amino acids 715-730, such as 725. If 
truncated molecules are used, it is preferable to use El and E2 molecules that are 
both truncated. 

E2 exists as multiple species (Spaete et al., Virol. (1992) 188:819-830; Selby 
etal., J ViroL (1996) 70:5177-5182; Grakouietal., 7. ViroL (1993)67:1385-1395; 
Tomei et al., J, ViroL (1993) 67:4017-4026) and clipping and proteolysis may occur 
at the N- and C-termini of the El and E2 polypeptides. Thus, an E2 polypeptide for 
use herein may comprise at least amino acids 405-661, e.g., 400, 401, 402... to 661, 
such as 384-661, 384-715, 384-746, 384-749 or 384-809, or 384 to any C-terminus 
between 661-809, of an HCV polyprotein, numbered relative to the full-length HCV- 
1 polyprotein. Similarly, preferable El polypeptides for use herein can comprise 
amino acids 192-326, 192-330, 192-333, 192-360, 192-363, 192-383, or 192 to any 
C-terminus between 326-383, of an HCV polyprotein. 
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The El and E2 polypeptides and complexes thereof may also be present as 
asialoglycoproteins. Such asialoglycoproteins are produced by methods known in the 
art, such as by using cells in v^ch terminal glycosylation is blocked. When these 
proteins are expressed in such cells and isolated by GNA lectin affinity 
5 chromatography, the El and E2 proteins aggregate spontaneously. Detailed methods 
for producing these E1E2 aggregates are described in, e.g., U.S. Patent No. 
6,074,852. For example, E1E2 complexes are readily produced recombinantly, either 
as fusion proteins or by e.g., co-transfectmg host cells with constructs encoding for 
the El and E2 polypeptides of interest. Co-transfection can be accomplished either 

10 in trans or cw, i.e., by using separate vectors or by using a single vector ^ch bears 
both of the El and E2 genes. If done using a single vector, both genes can be driven 
by a single set of control elements or, alternatively, the genes can be present on the 
vector in individual expression cassettes, driven by individual control elements. 
Following expression, the El and E2 proteins will spontaneously associate. 

1 5 Alternatively, the complexes can be formed by mixing the individual proteins 

together which have been produced separately, either in purified or semi-purified 
form, or even by mixing culture media in which host cells expressing the proteins, 
have been cultured, if the proteins are secreted. Finally, the E1E2 complexes of the 
present invention may be expressed as a fusion protein wherein the desired portion of 

20 El is fused to the desired portion of E2. 

Moreover, the E1E2 complexes may be present as a heterogeneous mixture of 
molecules, due to clipping and proteolytic cleavage, as described above. Thus, a 
composition including E1E2 complexes may include multiple species of E1E2, such 
as E1E2 terminating at amino acid 746 (EIE2746), E1E2 terminating at amino acid 

25 809 (E1E2809)5 or any of the other various El and E2 molecules described above, 

such as E2 molecules with N-terminal truncations of from 1-20 amino acids, such as 
E2 species beginning at amino acid 387, amino acid 402, amino acid 403, etc. 

E1E2 complexes are readily produced recombinantly, either as fusion proteins 
or by e.g., co-transfecting host cells with constructs encoding for the El and E2 

30 polypeptides of interest. Co-transfection can be accomplished either in trans or cis, 
i.e., by using separate vectors or by using a single vector which bears both of the El 
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and E2 genes. If done using a single vector, both genes can be driven by a single set 
of control elements or, alternatively, the genes can be present on the vector in 
individual expression cassettes, driven by individual control elements. Follo>ving 
expression, die El and E2 proteins will spontaneously associate. Alternatively, the 

5 complexes can be formed by mixing the individual proteins togetheir which have been 
produced separately, either in purified or semi-purified form, or even by mixing 
culture media in which host cells explressing the proteins, have been cultured, if the 
proteins are secreted. Finally, the E1E2 complexes of the present invention noiay be 
expressed as a fiision protem wherein the desired portion of El is fused to the desired 

10 portion of E2. 

Methods for producing E1E2 complexes from fixU-length, truncated El and 
E2 proteins which are secreted into media, as well as intracellxilarly produced 
truncated proteins, are known in the art. For example, such complexes may be 
produced recombinantly, as described in U.S. Patent No. 6,121,020; Ralston et al., J. 

15 Virol (1993) 67:6753-6761, Grakoui et al., J, Virol (1993) 62:1385-1395; and 
Lanford et al.. Virology (1993) 122:225-235. 

Polynucleotides Encoding the Fusion Proteins and E1E2 Complexes 

Polynucleotides contain less than an entire HCV genome and can be RNA or 

20 single- or double-stranded DNA. Preferably, the polynucleotides are isolated free of 
other components, such as proteins and lipids. The polynucleotides encode the 
frision proteins, El and E2 polypeptides and complexes thereof, described above, and 
thus comprise coding sequences thereof Polynucleotides of the invention can also 
comprise other non-HCV nucleotide sequences, such as sequences coding for linkers, 

25 signal sequences, or ligands useful in protein purification such as glutathione-S- 
transferase and staphylococcal protein A. 

Polynucleotides encoding the various HCV polypeptides can be isolated from 
a genomic library derived from nucleic acid sequences present in, for ©cample, the 
plasma, serum, or liver homogenate of an HCV infected individual or can be 

30 synthesized in the laboratory, for example, using an automatic synthesizer. An 
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amplification method such as PGR can be used to amplify polynucleotides firom 
either HCV genomic DNA or cDNA encoding therefor. 

Polynucleotides can comprise coding sequences for these polypeptides which 
occur naturally or can include artificial sequaices which do not occur in nature. 
5 These polynucleotides can be ligated to form a coding sequence for the fusion 
proteins and E1E2 complexes using standard molecular biology teclmiques. If 
desired, polynucleotides can be cloned into an expression vector and transformed 
into, for example, bacterial, yeast, insect, or mammalian cells so that the fusion 
proteins of the invention can be expressed in and isolated from a cell culture. 

10 The expression constructs of the present invention, including the desired 

fusion, or individual expression constructs comprising the individual components of 
these fusions, may be used for nucleic acid immunization, to stimulate an 
immimological response, such as a cellular immune response, using standard gene 
delivery protocols. Methods for gene delivery are known in the art. See, e.g.,U.S. 

15 Patent Nos. 5,399,346, 5,580,859, 5,589,466. Genes can be delivered eiflier directiy 
to the vertebrate subject or, alternatively, delivered ex vivo, to cells derived from the 
subject and the cells reimplanted in the subject. For example, the constructs can be 
delivered as plasmid DNA, e.g., contained within a plasmid, such as pBR322, pUC, 
or ColEl 

20 Additionally, the expression constructs can be packaged in liposomes prior to 

delivery to the cells. Lipid encapsulation is generally accomplished using liposomes 
which are able to stably bind or entrap and retain nucleic acid. The ratio of 
condensed DNA to lipid preparation can vary but will generally be around 1 : 1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as 

25 carriers for delivery of nucleic acids, see. Hug and Sleight, Biochim. Biophys. Acta. 
(1991) 1097 :1-17: Straubinger et al., mMethods ofEnzymology (1983), Vol. 101, pp. 
512-527. 

Liposomal preparations for use with the present invention include cationic 
(positively charged), anionic (negatively charged) and neutral preparations, with 
30 cationic liposomes particularly preferred. Cationic liposomes are readily available. 
For example, N[l-2,3-dioleyloxy)propyl]-N,N,N-triethylammonium (DOTMA) 
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liposomes are available under the trademark Lipofectin, from GIBCO BRL, Grand 
Island, NY. (See, also. Feigner et al., ProcNatl Acad, Sci. USA (1987) 84:7413- 
7416). Other commercially available lipids include transfectace (DDAB/DOPE) and 
DOTAP/DOPE (Boerhinger). Other cationic liposomes can be prepared from readily 
5 available materials using techniques well known in the art See, e.g., Szoka et aL, 
Proc. Natl. Acad. Sci. USA (1978) 75:4194^198; PCT PubUcation No. WO 
90/11092 for a description of the synthesis of DOTAP (l,2-bis(oleoyloxy)-3- 
(trimefhylammonio)propane) liposomes. The various liposome-nucleic acid 
complexes are prepared using methods known in the art. See, e.g., Straubinger et al., 

10 in METHODS OF IMMUNOLOGY (1983), Vol. 101, pp. 512-527; Szoka et al., 
Proc, Natl. Acad. Sci. USA (1978) 75:4194-4198; Papahadjopoulos et al., Biochim. 
Biophys. Acta (1975) 394:483; Wilson et al., Cell (1979) 17:77); Deamer and 
Bangham, Biochim. Biophys. Acta (1976) 443:629: Ostro et al., Biochem. Biophys. 
Res. Commun. (1977) 26:836; Fraley et al., Proc. Natl. Acad. Sci. USA (1979) 

15 26:3348); Enoch and Strittmatter, Proc. Natl Acad. Sci. USA (1979) 26:145); Fraley 
et aL, /. Biol. Chem. (1980) 255:10431; Szoka and Papahadjopoulos, Proc. Natl 
Acad. Sci. USA (1978) 25:145; and Schaefer-Ridder et al., Science (1982) 211:166. 

The DNA can also be delivered in cochleate lipid compositions similar to 
those described by Papahadjopoulos et al., Biochem. Biophys. Acta. (1975) 394:483- 

20 491. See, also, U.S. Patent Nos. 4,663,161 and 4,871,488. 

A number of viral based systems have "been developed for gene transfer into 
maiomaUan cells. For example, retroviruses provide a convenient platform for gene 
delivery systems, such as murine sarcoma virus, mouse mammary tumor virus, 
Moloney murine leukemia virus, and Rous sarcoma virus. A selected gene can be 

25 inserted into a vector and packaged in retroviral particles using techniques known in 
the art The recombinant virus can then be isolated and delivered to cells of the 
subject either in vivo or ex vivo. A number of retroviral systems have been described 
(U.S. Patent No. 5,219,740; Miller and Rosman, BioTechniques (1989) 7:980-990; 
Miller, A.D., Human Gene Therapy (1990) i:5-14; Scarpa et al., Virology (1991) 

30 180:849-852; Bums et al., Proc. Natl Acad Sci. USA (1993) 20:8033-8037; and 
Boris-Lawrie and Temin, Cur. Opin. Genet. Develop. (1993) 3:102-109. Briefly, 
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retroviral gene delivery vehicles of the present invention may be readily constructed 
from a wide variety of retroviruses, including for example, B, C, and D type 
retroviruses as well as spumaviruses and lentiviruses such as FIV, HIV, HIV-1, HIV- 
2 and SIV (see RNA Tumor Viruses, Second Edition, Cold Spring Harbor 
5 Laboratory, 1985). Such retroviruses may be readily obtained from depositories or 
collections such as the American Type Culture Collection ("ATCC"; 10801 
University Blvd., Manassas, VA 201 10-2209), or isolated from known sources using 
commonly available techniques. 

A number of adenovirus vectors have also been described, such as adenovirus 

10 Type 2 and Type 5 vectors. Unlike retroviruses which integrate into the host 
genome, adenoviruses persist extrachromosomally thus minimizing the risks 
associated with insertional mutagenesis (Haj-Ahmad and Graham, J, Virol (1986) 
52:267-274; Bett et al., J, Virol (1993) 67:591 1-5921; Mittereder et al.. Human Gene 
Therapy (1994) 5:717-729; Seth et al., J, Virol (1994) 68:933-940; Ban et al.. Gene 

15 Tlierapy (1994) 1:51-58; Berkner, K.L. BioTechniques (1988) 6:616-629; and Rich et 
al.. Human Gene Therapy (1993) 4:461-476). 

Molecular conjugate vectors, such as ttie adenovirus chimeric vectors 
described in Michael et al., J. Biol Chem. (1993) 268:6866-6869 and Wagner et al., 
Proc. Natl Acad, ScL USA (1992) 89:6099-6103, can also be used for gene deUvery. 

20 Members of the Alphavirus genus, such as but not Umited to vectors derived 

from tiie Sindbis and Semliki Forest viruses, VEE, will also find use as viral vectors 
for delivering the gene of interest. For a description of Sindbis-virus derived vectors 
useful for the practice of the instant methods, see, Dubensky et al., J. Virol (1996) 
70:508-519; and Intemational PubUcationNos. WO 95/07995.and WO 96/17072. 

25 Other vectors can be used, including but not limited to simian virus 40 and 

cytomegalovirus. Bacterial vectors, such as Salmonella ssp. Yersinia enterocolitica. 
Shigella spp.. Vibrio cholerae, Mycobacterium strain BCG, and Listeria 
monocytogenes can be used. Minichromosomes such as MC and MCI, 
bacteriophages, cosmids (plasmids into which phage lambda cos sites have been 

30 inserted) and replicons (genetic elements that are capable of replication under their 
own control in a cell) can also be used. 
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The expression constructs may also be encq)sulated, adsorbed to, or 
associated with, particulate carriers. Such carriers present multiple copies of a 
selected molecule to the immune system and promote trapping and retention of 
molecules in local lymph nodes. Hie particles can be phagocytosed by macrophages 
5 and can enhance antigen presentation through cytokine release. Examples of 

particulate carriers include those derived from polymethyl methacrylate polymers, as 
well as microparticles derived from poly(lactides) and poly(lactide-co-glycolides), 
known as PLG. See, e.g., Jefifery et al., Pharm. Res, (1993) 10:362-368; and McGee 
et al., J. Microencap. (1996). 

1 0 One preferred method for adsorbing macromolecules onto prepared 

microparticles is described in International Publication No. WO 00/050006. Briefly, 
microparticles are rehydrated and dispersed to an essentially monomeric suspension 
of microparticles using dialyzable anionic or cationic detergents. Useful detergents 
include, but are not limited to, any of the various N-methylglucamides (known as 

1 5 MEGAs), such as heptanoyl-N-methylglucaimde (MEGA-7), octanoyl-N- 
methylglucamide (MEGA-8), nonanoyl-N-methylglucamide (MEGA-9), and 
decanoyl-N-mefliyl-glucamide (MEGA-10); cholic acid; sodium cholate; 
deoxycholic acid; sodiuni deoxycholate; taurocholic acid; sodium taurocholate; 
taurodeoxycholic acid; sodium taurodeoxycholate; 3-[(3- 

20 cholamidopropyl)dimethylammomo] -1-propane-sulfonate (CHAPS); 3-[(3- 

cholamidopropyl) dmiethylammomo]-2-hydroxy-l-propane-sulfonate (CHAPSO); 
-dodecyl-N,N-dimethyl-3-ammonio-l-propane-sulfonate (ZWITTERGENT 3-12); 
N,N-bis-(3-D-gluconeamidopropyl)-deoxycholandde(DEOXY-BIGCHAP); 
-octylglucoside; sucrose monolaurate; glycocholic acid/sodium glycocholate; 

25 laurosarcosine (sodium salt); glycodeoxycholic acid/sodium glycodeoxycholate; 
sodium dodceyl sulfate (SDS); 3-(trimethylsilyl)-l-propanesulfonic acid (DSS); 
cetrimide (CTAB, the principal component of which is 

hexadecyltrimethylammonium bromide); hexadecyltrimethylammonium bromide; 
dodecyltrimethylammonium bromide; hexadecyltrimethyl-ammonium bromide; 
30 tetradecyltrimethylammonixmi bromide; benzyl dimethyldodecylammonium 

bromide; benzyl dimethyl-hexadecylammonium chloride; and ben2yl dimethyltetra- 
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decylainmonixxm bromide. The above detergents are commercially available from 
e.g., Sigma Chemical Co., St. Loxiis, MO. Various cationic lipids known in the art 
can also be used as detergents. See Balasubramaniam at al., 1996, Gene Ther,, 
3:163-72 and Gao, X., and L, Huang. 1995, Gene Ther., 2:7110-722. 
5 A wide variety of other methods can be used to deliver the expression 

constructs to cells. Such methods include DEAE dextran-mediated transfection, 
calcium phosphate precipitation, polylysine- or polyomithine-mediated transfection, 
or precipitation using other insoluble inorganic salts, such as strontium phosphate, 
aluminum silicates including bentonite and kaolin, chromic oxide, magnesium 

10 silicate, talc, and the like. Other useful methods of transfection include 

electroporation, sonoporation, protoplast fusion, liposomes, peptoid deUvery, or 
microinjection. See, e.g.; Sambrook et al., supra, for a discussion of techniques for 
transforming cells of interest; and Feigner, P.L., Advanced Drug Delivery Reviews 
(1990) 5:163-187, for a review of delivery systems useful for gene transfer. Methods 

15 of delivering DNA using electroporation are described in, e.g., U.S. Patent Nos. 
6,132,419; 6,451,002, 6,418,341, 6233,483, U.S. Patent Publication No. 
2002/0146831; and International Publication No. WO/0045823. 

Moreover, the HCV polynucleotides can be adsorbed to, or entrapped within, 
an ISCOM. Classic ISCOMs are formed by combination of cholesterol, saponin, 

20 phospholipid, and immunogeos, such as viral envelope proteins. Generally, the ,HC V 
molecules (usually with a hydrophobic region) are solubilized in detergent and added 
to the reaction mixture, whereby ISCOMs are formed with the HCV molecule 
incorporated therein. ISCOM matrix compositions are formed identically, but 
without viral proteins. Proteins with high positive charge may be electrostatically 

25 bound in the ISCOM particles, rather than through hydrophobic forces. For a more 
detailed general discussion of saponins and ISCOMs, and methods of formvdating 
ISCOMs, see Barr et al. (1998) Adv. Drug Delivery Reviews 32:247-271 (1998); U.S. 
Patent Nos. 4,981,684, 5,178,860, 5,679,354 and 6,027,732; European Publ. Nos. 
EPA 109,942; 180,564 and 231,039; and Coulter et al. (1998) Vaccine 16:1243. 

30 Additionally, biolistic delivery systems employing particulate carriers such as 

gold and tungsten, are especially useful for delivering the expression constructs of 
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the present invention. The particles are coated with the construct to be delivered and 
accelerated to high velocity, generally under a reduced atmosphere, using a gun 
powder discharge from a "gene gun." For a description of such techniques, and 
£5)paratuses useful therefore, see, e,g., U.S. Patent Nos. 4,945,050; 5,036,006; 
5 5,100,792; 5,179,022; 5,371,015; and 5,478,744. 

Compositiojis Comprising Fusion Proteins or Polynucleotides 

The invention also provides conapositions comprising the fusion proteins or 
polynucleotides, as well as compositions including the individual components of 

10 these fusion proteins or polynucleotides. Compositions of the invention preferably 
comprise a pharmaceutically acceptable carrier. The carrier should not itself induce 
the production of antibodies harmfial to the host. Pharmaceutically acceptable 
carriers are well known to those in the art. Such carriers include, but are not limited 
to, large, slowly metabolized, macromolecules, such as proteins, polysaccharides 

15 such as latex functionaUzed sepharose, agarose, cellulose, cellulose beads and the 
like, polylactic acids, polyglycolic acids, polymeric amino acids such as 
polyglutamic acid, polylysine, and the like, amino acid copolymers, and inactive 
virus particles. 

Pharmaceutically acceptable salts can also be used in compositions of the 
20 invention, for example, mineral salts such as hydrochlorides, hydrobromides, 
phosphates, or sulfates, as well as salts of organic acids such as acetates, 
proprionates, malonates, or benzoates. Especially useful protein substrates are serum 
albumins, keyhole limpet hemocyanin, inmiunoglobulin molecules, fhyroglobulin, 
ovalbumin, tetanus toxoid, and other proteins well known to those of skill in the art. 
25 Compositions of the invention can also contain liquids or excipients, such as water, 
saline, glycerol, dextrose, ethanol, or the like, singly or in combination, as well as 
substances such as wetting agents, emulsifying agents, or pH buffering agents. 
Liposomes can also be used as a carrier for a composition of the invention, such 
liposomes are described above. 
30 If desired, co-stimulatory molecules which improve immunogen presentation 

to lymphocytes, such as B7-1 or B7-2, or cytokines such as GM-CSF, IL-2, and BL- 
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12, can be included in a composition of the invention. Optionally, adjuvants can also 
be included in a conq)osition. Adjuvants which can be used include, but are not 
limited to: (1) aluminum salts (alum), such as aluminum hydroxide, aluminum 
phosphate, aluminum sulfate, etc.; (2) oil-in-water emulsion formulations (with or 
S without other specific immunostimulating agents such as muramyl peptides (see 

below) or bacterial cell wall components), such as for example (a) MF59 (U.S. Patent 
No. 6,299,884; Chapter 10 in Vaccine design: the submit and adjuvant approach, 
eds. Powell & Newman, Plenum Press 1995), containing 5% Squalene, 0.5% 
TWEEN 80™, and 0.5% SPAN 85™ (optionally containing various amounts of 

10 MTP-PE (see below), although not required) formulated into submicron particles 
using a microfluidizer such as Model 1 lOY microfluidizer (Microfluidics, Newton, 
MA), (b) SAP, containing 10% Squalane, 0.4% TWEEN 80™, 5% pluronic-blocked 
polymer L121, and thr-MDP either microfluidized into a submicron emulsion or 
vortexed to generate a larger particle size emulsion, and (c) RIBI™ adjuvant system 

15 (RAS), (Elibi Immunochem, Hamilton, MT) containing 2% Squalene, 0.2% TWEEN 
80™, and onp or more bacterial cell wall components from the group consisting of 
monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton 
(CWS), preferably MPL + CWS (DETOX™); (3) saponin adjuvants, such as QS21 
or STIMULON™ (Cambridge Bioscience, Worcester, MA) may be used or particles 

20 generated therefrom such as ISCOMs (immunostimulating complexes), which 

ISCOMs may be devoid of additional detergent, see, e.g., International Publication 
No. WO 00/07621; (4) Complete Freund*s Adjuvant (CPA) and Incomplete Freund's 
Adjuvant (IFA); (5) cytokines, such as interleukins (IL-1, IL-2, IL-4, IL-5, IL-6, IL- 
7, IL-12 (Intemational Publication No. WO 99/44636), etc.), interferons (e.g., 

25 gamma interferon), macrophage colony stimulating fiictor (M-CSF), tumor necrosis 
factor (TNF), etc.; (6) detoxified mutants of abacterial ADP-ribosylating toxin such 
as a cholera toxin (CI), a pertussis toxin (PT), or an E, coli heat-labile toxin (LT), 
particularly LT-K63 (where lysine is substituted for the wild-type amino acid at 
position 63) LT-R72 (where arginine is substituted for the wild-type amino acid at 

30 position 72), CT-S109 (where serine is substituted for the wild-type amino acid at 
position 109), and PT-K9/G129 (where lysine is substituted for the wild-type amino 
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acid at position 9 and glycine substituted at position 129) (see, e.g.. International 
PubUcation Nos. W093/13202 and W092/19265); (7) MPL or 3-0-deacylated MPL 
(3dMPL) (see, e.g., GB 2220221), EP-A-0689454, optionally in the substantial 
absence of alum when used with pneumococcal saccharides (see, e.g.. International 

5 Publication No. WO 00/56358); (8) combinations of 3dMPL with, for example, 
QS21 and/or oil-in-water emulsions (see, e.g., EP-A-0835318, EP-A-0735898, EP- 
A-0761231; (9) oligonucleotides comprismg CpGmotife (see, e.g., Roman et al. 
(1997) Nat. Med. 3:849-854; Werner et al. (1997) Proc. Natl. Acad. Sci. USA 
94:10833-10837; Davis et al. (1998) J. Immunol. 160-.870-876; Chu etal. (1997) /. 

10 Exp. Med. 186: 1623- 1631; Lipford et al. (1997) Eur. J. Immunol. 27:2340-2344; 
Moldoveanu et al. (1988) Vaccine M:1216-1224; Krieg et al. (1995) Nature 
174:546-549; Klimnan et al. (1996) Proc. Natl. Acad Sci. USA 23:2879-2883; Ballas 
et aL (1996) /. Immunol 152:1840-1845; Cowdery et al. (1996) J. Immunol. 
15^:4570-4575; Halpem et al. (1996) Cell Immunol. l^.72'7S; Yamamoto et al. 

15 (1988) Jpn. J. Cancer Res. 22:866-873; Stacey et aL (1996) J. Immunol. 152:21 16- 
2122; Messina et al. (1991) J Immunol 1£2::1759-1764; Yi et al. (1996) J. Immunol 
152:4918-4925; Yi et al. (1996)7. Immunol 152:5394-5402; Yi et al. (1998) J. 
Immunol 1^:4755-4761; Yi et al. (1998)7. Immunol 160:5898-5906; International 
PubUcation Nos. WO 96/02555, WO 98/16247, WO 98/18810, WO 98/40100. WO 

20 98/55495, WO 98/379 19 and WO 98/52581), such as those containing at least on CG 
dinucleotide, with cytosine optionally replaced with 5-methylcytosine; (10) a 
polyoxyethylene ether or apolyoxyethylaie ester (see, e.g.. International Publication 
No. WO 99/52549); (11) a polyoxyethylene sorbitan ester surfactant in combination 
with an octoxynol (see, e.g.. International Publication No. WO 01/21207) or a 

25 polyoxyethylene alkyl ether or ester surfactant in combination with at least one 
additional non-ionic surfactant such as an octoxynol (see, e.g.. International 
Publication No. WO 01/21152); (12) a s^onin and an immunostimulatory 
oligonucleotide such as a CpG oligonucleotide (see, e.g., Litemational Publication 
No. WO 00/62800); (13) an immunostimulant and a particle of metal salt (see, e.g., 

30 International Publication No. WO 00/23 105); and (14) other substances that act as 
immunostunulating agents to enhance the effectiveness of the composition. 
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Muramyl peptides include, but are not limited to, N-acetyl-m\iramyl-L- 
threonyl-D-isoglutamine (thr-MDP), N-acteyl-nonnuramyl-L-alanyl-D-isogluatme 
(nor-MDP), -acetylniuramyl-L-alanyl-D-isogluatminyl-I^danine-2-(l-2- 
dipalmitoyl-j/z-glyceix)-3-huydroxyphosphoryloxy)-eth^^ (MTP-PE), etc. 
5 Moreover, the HCV proteins can be adsorbed to, or entrapped within, an 

ISCOM, as described above. Additionally, ISCOMs with adsorbed HCV core 
proteins, either the entire core region or a jfragment of HCV core protein, may be 
added to the formulations. Most preferably, the HCV core protein is a fragment 
comprising a polypeptide from the region spanning amino acid positions 121-135. 

10 See, e.g., Intemational Publication No. WO 01/37869A. 

As explained above, tiie composition may also contain immunostimulatory 
molecules, either in addition to or in place of the antigen delivery system. ' 
Immunostimulatory agents for use herein include, without limitation, 
monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton 

15 (CWS), preferably MPL + CWS (DetoxTW). MPL may be formulated into an 

emulsion to enhance its immunostimulatory affect. See, e.g., Uhich et al., "MPLr 
immunostimulat: adjuvant formulations." m Vaccine Adjuvants: Prepartion Methods 
and Research Protocols (O'Hagan DT, ed.) Human Press Inc., NJ (2000) pp. 273- 
282. MPL has been shown to induce the synthesis and release of cytokines, 

20 particularly IL-2 and IFN-y. Other useful immunostimulatory molecules include 

LPS and immunostimulatory nucleic acid sequences (ISS), including but not limited 
to, uiunethylated CpG motife, such as CpG oligonucleotides. 

Oligonucleotides containing unmethylated CpG motifs have been shown to 
induce activation of B cells, NK cells and antigeu-presenting cells (APCs), such as 

25 monocytes and macrophages. See, e.g., U.S. Patent No. 6,207,646. Thus, adjuvants 
derived from the CpG family of molecules, CpG dinucleotides and synthetic 
oligonucleotides which comprise CpG motifs (see, e.g., Krieg et al. Nature (1995) 
374:546 and Davis et al. J, Immunol. (1998) 160:870-876) such as any of the various 
immunostimulatory CpG oligonucleotides disclosed in U.S. Patent No. 6,207,646, 

30 may be used in the subject methods and compositions. Such CpG oligonucleotides 
generally comprise at least 8 up to about 100 basq)airs, preferably 8 to 40 basepairs, 
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more preferably 15-35 basepairs, preferably 15-25 basepairs, and any number of 
basepairs between these values. For example, oligonucleotides comprising the 
consensus CpG motif, represented by the formula 5*-XiCGX2-3*, where and Xj are 
nucleotides and C is unmethylated, will find use as immxmostimulatory CpG 
5 molecules. Generally,X, is A,GorT,andX2is CorT. Other useful CpG 

molecules include those captured by the formula 5'-XiX2CGX3X4, where X, and X2 
are a sequence such as GpT, GpG, GpA, ApA, ApT, ApG, CpT, CpA, CpG, TpA, 
TpT or TpG, and X3 and X4 are TpT, CpT, ApT, ApG, CpG, TpC, ApC, CpC, TpA, 
ApA, GpT, CpA, or TpG, wherein "p" signifies a phosphate bond. Preforably, the 

10 oligonucleotides do not include a GCG sequence at or near the 5 - and/or 3' terminus. 
Additionally, the CpG is preferably flanked on its 5 -end with two purines (preferably 
a Gp A dinucleotide) or with a purine and a pyrimidine (preferably, GpT), and 
flanked on its 3'-end with two pyrimidines, preferably a TpT or TpC dinucleotide. 
Thus, preferred molecules will con5)rise the sequence GACGTT, GACGTC, 

15 GTCGTT or GTCGCT, and these sequences will be flanked by several additional 
nucleotides. The nucleotides outside of this central core area appear to be extremely 
amendable to change. 

Moreover, the CpG oligonucleotides for use herein may be double- or single- 
stranded. Double-stranded molecules are more stable in vivo while single-stianded 

20 molecules display enhanced immune activity. Additionally, the phosphate backbone 
may be modified, such as phosphorodithioate-modified, in order to enhance the 
immunostimulatory activity of the CpG molecule. As described in U.S. Patent No. 
6,207,646, CpG molecules with phosphorothioate backbones preferentially activate 
B-cells, while those having phosphodi^er backbones preferentially activate 

25 monocytic (macrophages, dendritic cells and monocytes) and NK cells. 

One exemplary CpG oligonucleotide for use in the present compositions has 
the sequence 5'-TCCATGACGTTCCTGACGTT-3' (SEQ ID NO:6). 

CpG molecules can readily be tested for their ability to stunulate an immune 
response using standard techniques, well known in the art. For example, the ability 

30 of the molecule to stimulate a humoral and/or cellular immune response is readily 
determined using the immunoassays described above. Moreover, the antigen and 
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adjuvant compositions can be administered with and without the CpG molecule to 
detennine whether an immune response is enhanced. 

The HCV proteins may also be enc^sulated, adsorbed to, or associated with, 
particulate earners, as described above with reference to the HCV polynucleotides. 
5 As explained above, examples of particulate carriers include those derived from 
polymethyl methacrylate polymers, as well as microparticles derived from 
poly(lactides) andpolyOactide-co-glycolides), known as PLG. See, e.g., Jefifery et 
al., Pharm. Res, (1993) lfi:362-368; and McGee et al., J, Microencap. (1996). One 
preferred method for adsorbing macromolecules onto prepared microparticles is 
10 described above and in International Publication No. WO 00/050006. 

Methods of Producing HCV-Specific Antibodies 

The HCV proteins can be used to produce HCV-specific polyclonal and 
monoclonal antibodies. HCV-specLSc polyclonal and monoclonal antibodies 

15 specifically bind to HCV antigens. Polyclonal antibodies can be produced by 

administering the fusion protein to a mammal, such as a mouse, a rabbit, a goat, or a 
horse. Serum from the immuruLzed animal is collected and the antibodies are purified 
from the plasma by, for example, precipitation with ammonium sulfate, followed by 
chromatography, preferably aflHnity chromatography Techniques for producing and 

20 processing polyclonal antisera are known in the art. 

Monoclonal antibodies directed against HCV-specific epitopes present in the 
proteins can also be readily produced. Normal B cells from a mammal, such as a 
mouse, immunized with an HCV protein, can be fused with, for example, HAT- 
sensitive mouse myeloma cells to produce hybridomas. Hybridomas producing 

25 HCV-specific antibodies can be identified using RIA or ELISA and isolated by 

cloning in semi-solid agar or by limiting dilution. Clones producing HCV-specific 
antibodies are isolated by another round of screening. 

Antibodies, either monoclonal and polyclonal, which are directed against 
HCV epitopes, are particularly useful for detecting the presence of HCV or HCV 

30 antigens in a sample, such as a serum sample from an HCV-infected human. An 

immunoassay for an HCV antigen may utilize one antibody or several aatibodies. An 
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immunoassay for an HCV antigen may use, for example, a monoclonal antibody 
dii^ected towards an HCV epitope, a combination of monoclonal antibodies directed 
towards epitopes of one HCV polypeptide, monoclonal antibodies directed towards 
epitopes of different HCV polypeptides, polyclonal antibodies directed towards the 

S same HCV antigen, polyclonal antibodies directed towards different HCV antigens, 
or a combination of monoclonal and polyclonal antibodies. Immunoassay protocols 
may be based, for example, \spon competition, direct reaction, or sandwich type 
assays using, for example, labeled antibody. The labels may be, for example, 
fluorescent, chemilununescent, or radioactive. 

10 The polyclonal or monoclonal antibodies may further be used to isolate HCV 

particles or antigens by immunoafEnity columns. The antibodies can be affixed to a 
solid support by, for example, adsorption or by covalent linkage so that the 
antibodies retain their immunoselective activity. Optionally, spacer groups may be 
included so that the antigen binding site of the antibody remains accessible. The 

1 5 immobilized antibodies can then be used to bind HCV particles or antigens from a 
biological sample, such as blood or plasma. The bound HCV particles or antigens 
are recovered from the column matrix by, for example, a change in pH. 

HCV'Specific T cells 

20 HCV-specific T cells that are activated by the above-described fusions and 

E1E2 complexes, iacluding tiie NS3NS4NS5a fusion protein or NS3NS4NS5aNS5b 
fusion protein, and the E1E2 complexes, expressed in vivo or in vitro, preferably 
recognize an epitope of an HCV polypeptide such as an El, E2, NS3, NS4, NS5a, 
NS5b polypeptide, including an epitope of an NS3NS4NS5a fusion protein or an 

25 NS3NS4NS5aNS5b fusion protein, or an E1E2 complex. HCV-specific T cells can 
be CDS"^ or CD4^ 

HCV-specific CD8^ T cells preferably are cytotoxic T lymphocytes (CTL) 
which can kill HCV-infected cells tiiat display El, E2, NS3, NS4, NSSa, NS5b 
epitopes complexed with an MHC class I molecule. HCV-specific CDS'** T cells may 
30 also express interferon-y (IFN-^). HCV-specific CDS"" T cells can be detected by, for 
example, ^*Cr release assays (see the examples). ^^Cr release assays measure the 



44 



wo 2004/039950 



PCT/US2003/033610 



ability of HCV-specific CDS'" T cells to lyse target cells displaying an El, E2, E1E2, 
NS3, NS4, NS5a, NS5b, NS3NS4NS5a, orNS3NS4NS5aNS5b epitope. HCV- 
specific CD8^ T cells which express IFN-7 can also be detected by immunological 
methods, preferably by intracellular staming for IFN-y after in vitro stimulation with 
5 an El, E2, NS3, an NS4, an NS5a, or an NS5b polypeptide (see the examples). 

HCV-specific CD4^ cells activated by the above-described E1E2 complexes 
and fusions, such as an El polypeptide, an E2 polypeptide, an E1E2 complex, 
NS3NS4NS5a or NS3NS4NS5aNS5b fusion protein, expressed in vivo or in vitro, 
preferably recognize an epitope of an El, E2, NS3, NS4, NS5a, or NS5b polypeptide, 

10 including an epitope of an E1E2 complex, NS3NS4NS5a or NS3NS4NS5aNS5b 
fusion protein, that is bound to an MHC class n molecule on an HCV-infected cell 
and proliferate in response to stimulating E1E2 complexes with NS3NS4NS5a or 
NS3NS4NS5aNS5b peptides, with or without a core polypeptide. 

HCV-specific CD4"^ T cells can be detected by a lymphoproliferation assay 

15 (see examples). Lymphoproliferation assays measure the ability of HCV-specific 
CD4^ T cells to proliferate in response to an El, E2, NS3, an NS4, an NS5a, or an 
NS5b epitope. 

Methods of Activating HCVSpeciflcT Cells. 

20 The HC V proteins or polynucleotides can be used to stimulate an immune 

response, such as to activate HCV -specific T cells either in vitro or in vivo. 
Activation of HCV-specific T cells can be used, inter alia, to provide model systems 
to optimize CTL responses to HCV and to provide prophylactic or therq)eutic 
treatment against HCV infection. For in vitro activation, proteins are preferably 

25 supplied to T cells via a plasmid or a viral vector, such as an adenovirus vector, as 
described above. 

Polyclonal populations of T cells can be derived from the blood, and 
preferably 

firom peripheral lymphoid organs, such as lymph nodes, spleen, or thymus, of 
30 mammals that have been infected with an HCV. Preferred mammals include mice, 
chimpanzees, baboons, and humans. The HCV serves to expand the number of 
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activated HCV-specijBc T cells in the mammal. The HCV-specific T cells derived 
from the mammal can then be restimulated in vitro by adding, e.g., HCV E1E2 and 
NS3NS4NS5a or NS3NS4NS5aNS5b epitopic peptides, with or without a core 
polypeptide, to the T cells. The HCV-specific T cells can then be tested for, inter 
5 alia, proliferation, the production of IFN-7, and the ability to lyse target cells 
displaying E1E2, NS3NS4NS5aor NS3NS4NS5aNS5b epitopes in vitro. 

In a lymphoproliferation assay (see examples), HCV-activated CD4* T cells 
proliferate when cultured with an NS3, NS4, NSSa, NS5b, NS3NS4NS5a, or 
NS3NS4NS5aNS5b epitopic peptide, but not in the absence of an epitopic peptide. 
10 Thus, particular El, E2, NS3, NS4, NSSa, NS5b. NS3NS4NS5a and 

NS3NS4NS5aNS5b epitopes that are recognized by HCV-specific CD4* T cells can 
be identified using a lymphoproliferation assay. 

Similarly, detection of IFN-7 in HCV-specific CD8+ T cells after in vitro 
stimulation with the above-described HCV proteins, can be used to identify El, E2, 
15 E1E2, NS3, NS4, NSSa, NSSb, NS3NS4NSSa, and NS3NS4NSSaNSSb epitopes that 
particularly effective at stimulating CD8^ T cells to produce IFN-y (see examples). 

Further, ^^Cr release assays are usefid for determining the level of CTL 
response to HCV. See Cooper etal. Immunity 10:439-449. For example, HCV- 
specific CD8^ T cells can be derived from the liver of an HCV infected mammal. 
20 These T cells can be tested in ^^Cr release assays against target cells displaying, e.g., 
E1E2, NS3NS4NSSa and/or NS3NS4NSSaNSSb epitopes. Several target cell 
populations e5q)ressing different E1E2, NS3NS4NSSa and/or NS3NS4NSSaNS5b 
epitopes can be constructed so that each target cell population displays different 
epitopes of E1E2, NS3NS4NSSa and/or NS3NS4NSSaNSSb. The HCV-specific 
2S CDS* cells can be assayed against each of these target cell populations. The results 
of the ^^Cr release assays can be used to determine which epitopes of E1E2, 
NS3NS4NSSa and/or NS3NS4NSSaNSSb are responsible for the strongest CTL 
response to HCV. E1E2 complexes, NS3NS4NSSa fusion proteins or 
NS3NS4NSSaNSSb fixsion proteins, with or without core polypeptides, which 
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contain the epitopes responsible for the strongest CTL response can then be 
constructed using the mfonnation derived from the ^*Cr release assays. 

HC V proteins as described above, or polynucleotides encoding such proteins, 
can be administercd to a mammal, such as a mouse, baboon, chimpan23ee, or human, 
5 to stimulate an immune response, such as to activate HCV-specific T cells in vivo. 
Administration can be by any means known in the art, including paroateral, 
intranasal, intramuscular or subcutaneous injection, including injection using a 
biological ballistic gun ("gene gun"), as discussed above. 

Preferably, injection of an HCV polynucleotide is used to activate T cells. In 

1 0 addition to the practical advantages of simplicity of construction and modification, 
injection of the polynucleotides results in the synthesis of a fusion protein in the host. 
Thus, these immunogens are presented to the host mnnune system with native post- 
translational modifications, structure, and conformation* The polynucleotides are 
preferably injected intramuscularly to a large mammal, such as a human, at a dose of 

15 0.5, 0.75, 1 .0, L5, 2.0, 2.5, 5 or 10 mg/kg. 

A composition of the invention comprising the HCV proteins or 
polynucleotides is administered in a manner compatible with the particular 
composition used and ia an amount which is effective to stimulate an immune 
response, such as to activate HCV-specific T cells as measured by, inter alia, a ^^Cr 

20 release assay, a lymphoproliferation assay, or by intracellular staining for IFN-y. The 
proteins and/or polynucleotides can be administered either to a mammal which is not 
infected with an HCV or can be administered to an HC V-infected manunal. The 
particular dosages of the polynucleotides or proteins in a composition will depend on 
many factors including, but not lunited to the species, age, and general condition of 

25 the mammal to which the composition is administered, and the mode of 

administration of the compositioa An effective amount of the composition of the 
invention can be readily determined using only routine experimentation. Jn vitro and 
in vivo models described above can be employed to identify ^propriate doses. The 
amount of polynucleotide used in the example described below provides general 

30 guidance which can be used to optimize the activation of HCV-specific T cells either 
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in vivo or in vitro. Generally, 0.5, 0.75, 1.0, 1.5, 2.0, 2.5, 5 or 10 mg of anHCV 
fusion and El and E2 polypeptides, such as an E1E2 complex, an NS3NS4NS5a or 
NS3NS4NS5aNS5b fusion protein or polynucleotide, with or without a core 
polypq)tide, will be administered to a large mammal, such as a baboon, chimpanzee, 
5 or human. If desired, co-stimulatory molecules or adjuvants can also be provided 
before, after, or together with the compositions. 

Immune responses of the mammal generated by the delivery of a composition 
of the invention, including activation of HCV-specific T cells, can be enhanced by 
varying the dosage, route of administration, or boosting regimens. Compositions of 
10 the invention may be given in a single dose schedule, or preferably in a multiple dose 
schedule in which a primary course of vaccination includes 1-10 separate doses, 
followed by other doses given at subsequent time intervals required to maintain 
and/or reenforce an immune response, for example, at 1-4 months for a second dose, 
and if needed, a subsequent dose or doses after several months. 

15 

Deposits of Strains Useful in Practicing the Invention 

A deposit of biologically pure cultures of the following strains was made with 
the American Type Culture Collection, 10801 University Boulevard, Manassas, VA. 
The accession number indicated was assigned after successful viability testing, and 

20 the requisite fees were paid, made under the provisions of the Budapest Treaty on the 
International Recognition of the Deposit of Microorganisms for the Purpose of Patent 
Procedure and the Regulations thereunder (Budapest Treaty). This assures 
maintenance of viable cultures for a period of thirty (30) years from the date of 
deposit. Tlie organisms will be made available by the ATCC under the terms of the 

25 Budapest Treaty, which assures permanent and unrestricted availability of the 

progeny to one determined by the U.S. Comnaissioner of Patents and Trademarks to 
be entitled thereto according to 35 U.S.C. §122 and the Commissioner's rules 
pursuant thereto (including 37 C.F.R. §1.12 with particular reference to 886 OG 
63 8). Upon the granting of a patent, all restrictions on the availability to the public of 

30 the deposited cultures will be irrevocably removed. 
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These deposits are provided merely as convenience to those of skill in the art, 
and are not an admission that a deposit is required under 35 U.S.C. §112. The 
nucleic acid sequences of these genes, as well as the amino acid sequences of the 
molecules encoded thereby, are incorporated herein by reference and are controlling 
5 in the event of any conflict with the description herein. A license may be required to 
make, use, or sell the deposited materials, and no such licoise is hereby granted. 

Plasmid Deposit Date ATCC No. 

10 E1E2-809 August 16, 2001 PTA-3643 



in. Experimental 

15 Below are examples of specific embodiments for canying out the present 

invention. The examples are offered for illustrative purposes only, and are not 
intended to limit the scope of the present invention in any way. Those of skill in the 
art will readily appreciate that the invention may be practiced in a variety of ways 
given the teaching of this disclosure. 

20 Efforts have been made to ensure accuracy with respect to numbers used (e.g., 

amoimts, temperatures, etc.), but some experimental error and deviation should, of 
course, be allowed for. 

EXAMPLE 1 

25 Production of NS3NS4NS5a Polynucleotides. 

A polynucleotide ^coding NS3NS4NS5a (approximately amino acids 1027 
to 2399, numbered relative to HCV-1) (also termed •'NS345a" herein) or NS5a 
(approximately amino acids 1973 to 2399, numbered relative to HCV-1) was isolated 
from an HCV. Polynucleotides encoding a methionine residue were ligated to the 5' 

30 end of these polynucleotides and the polynucleotides were cloned into plasmid, 
vaccinia virus, and adenovirus vectors. 
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Immunization Protocols. In one immunization protocol, mice were 
immunized with 50 fig of plasmid DNA encoding either NS5a or encoding an 
NS3NS4NS5a fusion protein by intramuscular injection into the tibialis anterior. A 
booster injection of 10'' pfo of vaccinia virus (W)-NS5a (intr^eritoneal) or 50 p.g of 

5 plasmid control (intramuscular) was provided 6 weeks later. 

In another immunization protocol, mice were injected intramuscularly in the 
tibialis anterior with adenovirus particles encoding an NS3NS4NS5a fusion 
protein. An intrq)eritoneal booster injection of 1 0^ pfii of W-NS5a or an 
intramuscular booster injection of 10*** adenovirus particles encoding NS3NS4NS5a 

1 0 was provided 6 weeks later. 

EXAMPLE 2 

Immunization with DNA encoding an NS3NS4NS5a fusion protein activates 
HCV-specific CD8^ T cells. 

15 ^^Cr Release Assay. A ^*Cr release assay was used to measure the ability of 

HCV-specific T cells to lyse target cells displaying an NS5a epitope. Spleen cells 
were pooled from the immunized animals. These cells were restibmulated in vitro for 
6 days with the CTL epitopic peptide p214K9 (2152-HEYPVGSQI^2160; SEQ ID 
NO: 1) from HCV-NS5a in the presence of IL-2. The spleen cells were then assayed 

20 for cytotoxic activity in a standard ^^Cr release assay against peptide-sensitized target 
cells (L929) expressing class I, but not class n MHC molecules, as described in 
Weiss (1980) J. Biol Chem. 255:9912-9917. Ratios of eflfector (T cells) to target (B 
cells) of 60:1, 20:1, and 7:1 were tested. Percent specific lysis was calculated for 
each effector to target ratio. 

25 . The results of the assays are shown in Tables 1 and 2. Table 1 demonstrates 

that immunization with plasmid DNA encoding an NS3NS4NS5a fusion protdn 
activates CDS* T cells which recognize and lyse target cells displaying an NS5a 
epitope. Surprisingly the NS5a polypeptide of the NS3NS4NS5a fusion protein was 
able to activate T cells even though the NS5a polypeptide was present in a fusion 

30 protein. 
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Similarly, Table 2 demonstrates that delivery of the NS3NS4NS5a fusion 
protein to mice by means of an adenovirus vector also activates CD8"^ T cells which 
recognize and lyse target cells displaying an HCV NS5a epitope. Thus, 
immunization with either "naked" (plasmid) DNA encoding an NS3NS4NS5a fusion 
5 pi-otem or adenovirus vector-encoded fiision protein can be used to activate HCV- 
specific T cells. 



EXAMPLES 

Immunization with DNA encodmg an NS3NS4NS5a fusion protein activates 

10 HCV-specific CDS-" T cells which express IFN-y. 

Intracellular Staining for Interferon-gamma (IFN-y), Intracellular staining 
for IFN-7 was used to identify the CDS"^ T cells that secrete IFN-y after in vitro 
stimulation with the NS5a epitope p214K9. Spleen ceUs of individual immunized 
mice were restimulated in vitro either with p2 14K9 or with a non-specific peptide for 

15 6-12 hours in the presence of IL-2 and monmsm. The cells were then stained for 
surface CDS and for intracellular IFN-7 analyzed by flow cytometry. The 
percent of CD8^ T cells which were also positive for IFN-y was then calculated. 
The results of these assays are shown in Tables 1 and 2, Table 1 demonstrates that 
CD8^ f cells activated in response to immunization with plasmid DNA encoding an 

20 NS3NS4NS5a fusion protein also express IFN-y. Immunization with an 

NS3NS4NS5a fusion protein encoded in an adenovirus also results in CD8^ HCV- 
specific T cells which express IFN-y, although to a lesser extent than immunization 
with a plasmid-encoded NS3NS4NS5a fusion protein (Table 2). 
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EXAMPLE 4 

Immunization with DNA encoding an NS3NS4NS5a fusion protein 
stimulates proliferation of HCV-specific CD4* T cells. 

Lymphoproliferation assay. Spleen cells from pooled immunized mice were 
5 depleted of CD8* T cells using magnetic beads and were cultured in triplicate with 
either p222D, an NS5a-epitopic peptide from HCV-NS5a (2224- 
AELffiANLLWRQEMG.2238; SEQ ID N0:2), or in medium alone. After 72 hours, 
cells were pulsed with Iji Ci per weU of ^H-thymidine and harvested 6-8 hours later. 
Incorporation of radioactivity was measured after harvesting. The mean cpm was 
10 calculated. 

As shown in Table 3, immunization with a plasmid-encoded NS3NS4NS5a 
fiision protein stimulates proliferation of CD4+ HCV-specific T cells. Immunization 
with an adenovirus vector encoding the fiision protem also resulted in stimulated 
proliferation of CD4-*' HCV-specific T cells (Table 4). 

15 



Table 3. HCV-NS5a-Specific CD4+ T CeUs in Mice 
Immunized with NS5a or NS345a DNA 


MeanCaPM 


NSSaDNA 


NS345aDNA 


p222D 


media 


p222D 


media 


4523 


740 


4562 


861 


(x6.1) 


(x5.3) 



p222D is a CD4+ epitopic peptide (aa: 2224-AELIEANLLWRQEMG-2238, SEO ID 
25 NO:2)fiomHCV-NS5a 
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Table 4. HCV-NS5-Specific CD4+ T CeUs Primed by 
Adenovirus or DNA Sicoding for NS345a 


Mean CPM 


NS345aAdeno 


NS345aDNA 


p222D 


media 


p222D 


media 


896 


357 


1510 


385 


(x2.5) 


(x3.9) 



p222D is a CD4+ epitopic peptide (aa: 2224-AELIEANLLWRQEMG-2238, SEQ ID 
10 NO:2)firomHCV-NS5a 

EXAMPLES 

Efficiency of NS345a-encoding DNA Vaccine Formulations to prime CTLs . 
in mice. 

1 5 Mice were immunized with either 1 0-1 00 ]ig of plasmid DNA encoding 

NS345a fusion protein as described in Example 1, with PLG-linked DNA encoding 
NS345a, described below, or with DNA encoding NS345a, delivered via 
electroporation (see, e.g., U.S. Patent Nos. 6,132,419; 6,451,002, 6,418,341, 
6233,483, U.S. Patent Publication No. 2002/0146831; and Ihtemational Publication 

20 No. WO/0045823, for this delivery technique). The immunizations were followed by 
a booster injection 6 weeks later of 1 x lO^pfii vaccinia virus encoding NS5a, 
plasmid DNA encoding NS345a or plasmid DNA encoding NS5a each as described 
in Example 1. 

PLG-delivered DNA. The polylactide-co-glycoUde (PLG) polymers were 
25 obtained from Boehringer Ingelheim, U.S A. The PLG polymer used in this study 
was RG505, which has a copolymer ratio of 50/50 and a molecular weight of 65 IcDa 
(manufacturers data). Cationic microparticles with adsorbed DNA were prepared 
using a modified solvent evaporation process, essentially as described in Singh et al.. 
Proa Natl Acad Set USA (2000) 97:811-816. Briefly, the microparticles were 
30 prepared by emulsifying 1 0 ml of a 5% w/v polymer solution in methylene chloride 
with 1 ml of PBS at high speed using an KA homogenizer. The primary emulsion 
was then added to 50ml of distilled water containing cetyl trimethyl ammonium 
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bromide (CTAB) (0.5% w/v). This resulted in the formation of a w/o/w emulsion 
which was stirred at 6000 rpm for 12 hours at room temperature, allowing ttie 
methylene chloride to evaporate. The resulting microparticles were washed twice in 
distilled water by centrifugation at 10,000 g and Jieeze dried. Following preparation, 
5 washing and collection, DNA was adsorbed onto the microparticles by incubating 
100 mg of cationic microparticles in a Img/ml solution of DNA at 4 C for 6 hours. 
The microparticles were then separated by centrifugation, the pellet washed with TE 
buffer and the microparticles were freeze dried. 

CTL activity and IFN-y expression were measured by ^^Cr release assay or 
10 intracellular staining as described in examples 2 and 3 respectively. The results are 
shown in Table 5. 

Results demonstrate that immunization using plasmid DNA encoding for 
NS345a to prime mice results in activation of CD8+ HCV specific T cells. 
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Table 5: Efficiency of NS345a-Encoding DNA Vaccine Formulations to Prime CTLs in 

Mice 








ICSforlFN-gamma 
(%CD8+ cells that are IFN- 
g+) 








NS345a 

DNA 

Vaccines 


Boost 


Mean 


SdtdevP 


#of 
mice 
tested 


% 
respon 

d- • 
ing 


#of 
expts 


fold 
increase 

vs. 
'naked' 

DNA 


CTL 
actlv 
i- 
ty? 


NS345a 
DNA 


YVNS5a 


1.02 


1.70 


41 


68% 


10 


N/A 


YES 


NS345a 
DNA 


NS345a 
DNA 


0.02 


0.04 


22 


5% 


5 


N/A 


YES 


NS345a 
DNA 


NS5a 
DNA 


0.22 


0.21 


24 


63% 


5 


N/A 


YES 


NS345a 
DNAeV 
(electro- 
poration) 


VVNS5a 


5.00 


4.36 


7 


100% 


2 


4.90 


YES 


PLGNS34 
5a DNA 


VVNS5a 


2.65 


2.54 


6 


100% 


2 


2.60 


YES 


PLGNS34 
5a DNA 


NS5a 
DNA 


0.33 


0.24 


15 


80% 


3 


1.50 


YES 



EXAMPLE 6 

25 Immunization routes and replicon particles SINCR (DC-^J encoding for NS345a 

Alphavirus replicon particles, for example, SINCR (DC+) were prepared as 
described in Polo et al., Proc. Natl Acad, ScL USA (1999) 96:4598-4603. Mice were 
injected with 5 x 10^ lU SINCR (DC+) replicon particles encoding for NS345a 
intramuscularly (IM) as described in Example 1, or subcutaneously (S/C) at tiie base of the 

30 tail (Bol) and foot pad (FP), or with a combination of 2/3 of the DNA delivered via IM 
administration and 1/3 via a BoT route. The immunizations were followed by a booster 
injection of vaccinia virus encoding NS5a as described in Example 1. 

IFN-y expression was measured by intracellular staining as described in Example 3. 
The results are shown in Table 6. The results demonstrate that immxmization via SINCR 

35 (DC+) replicon particles encoding for NS345a by a variety of routes results in CD8+ HCV 
specific T cells which express DFN-y. 
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% respond] 
mice 


100% 


100% 


100% 














NS345a 


ICSformsr-gamma 
;%CD8+ cells that are IFN-g+) 




t-H 






les Encoding 


# of mice, 
tested 






cn 


able 6: TmmuniTation Routes and SINCR (DC+) Replicon Partic 

(all mice WNS5a challenged) 


SdtdevP 


0.63 


0.29 


2.00 




Mean 


1.11 


0.62 


2.43 




Immunization Route 


100%IM(ta) 


H 
o 

s 

CO 
so 

O 
O 


2/3IM(ta)+l/3S/C(BoT) 


H 






o 










Vaccines 


SINCR(DC+)5: 


SiNCR(DC+)5: 


SINCR(DC+)5: 
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EXAMPLE? 

SINCR (DC+) vs SINDC (LP) replicon particles encoding for NS345a 
Alphavinis replicon particles, for example, SINCR (DC4-) and SINCR (LP) were 
prepared as described in Polo et al., Proc, Natl Acad. ScU USA (1999) 26:4598-4603. Mice 
5 were immunized >vith 1 x 1 0^ to 1 x 10' lU of SINCR (DC+) or SINCR (LP) repUcon 

particles encoding for NS345a, by intramuscular injection into the tibialis anterior, followed 
by a booster injection of 10' pfii vaccinia virus encoding NS5a at 6 weeks. 

IFN-Y expression was measured by intracellular staining as described in Example 3. 
Administration of an increase in the number of SINCR (DC+) replicon particles encoding 
10 NS345a resulted in an increase in % of CD8+ T cells expressing DFN-y. 

EXAMPLES 

Alphavirus replicon priming, followed by various boosting regimes. 
Alphavinis replicon particles, for example, SINCR (DC+) were prepared as 
15 described in Polo et al., Proc. Natl Acad. Set USA (1999) 26:4598-4603. Mice were 
primed with SINCR (DC+), 1,5 x 10^ lU repUcon particles encoding NS345a, by 
intramuscular injection into the tibialis anterior, followed by a booster of either 10-100 )ig 
of plasmidDNA encoding for NS5a, 10^° adenovirus particles encoding NS345a, l,5x 10^ 
lU SINCR (DC+) replicon particles encoding NS345a, or 10' pfu vaccinia virus encoding 
NS5a at 6 weeks. 

EFN-y expression was measured by intracellular staining as described in Example 3. 
The results are shown in Table 7. The results demonstrate that boosting with vaccinia virus 
encoding NS5a DNA results in the strongest generation of CD8+ HCV specific T cells 
which express IFN^. Boosting with plasmid encoding NSSa DNA also results in a good 
response, while lesser responses are noted with adenovirus NS345a or SINCR DC+ boosted 
animals. 
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Table 7: Alphavirus Replicon Particle Priming, Followed by Various Boosting Regimens 






ICSforlFN-ganima 
(%CD8+ cells that are IFN-g+) 




Vaccines 


Boost 


Mean 


SdtdevP 


# of mice 
tested 


#of 
expts 


% 

respond- 
ing mice 


1.5X10* 


NbSa DNA 


0.46 


0.36 


4 


1 


75% 


SINCR(DC+) 
1.5X10* 


AdenoNS345a 
(lOXlO'") 


0.04 


0.04 


4 


1 


25% 


SINCR(DC+) 
1.5X10* 


SINCR(DC+) 
1.5X10* 


0.06 


0.06 


8 


2 


25% 


SINCR(DC+) 
1.5X10* 


WNS5a 
(1X10') 


2.43 


2.45 


4 


1 


100% 



15 

EXAMPLE9 

Alphaviruses expressing NS345a 

Alphavirus replicon particles, for example, SINCR (DC+) and SINCR (LP) 
were prepared as described in Polo et al., Proc. Natl Acad. Set USA (1999) 

20 96:4598-4603. Mice were inununized with 1 x 10^ to 1 x 10^ lU SINCR (DC+) 
replicons encoding NS345a via a combination of delivery routes (2/3 IM and 1/3 
S/C) as weU as by S/C alone, or with 1 x 10^ to 1 x 10^ lU SINCR (LP) replicon 
particles encoding NS345a via a combination of delivery routes (2/3 IM and 1/3 S/C) 
as well as by S/C alone. 

25 The immunizations were followed by a booster injection of lO'' pfti vaccinia virus 
encoding NS5a at 6 weeks. 

IFN-y expression was measured by intracellular staining as described in 
Example 3. The results are shown in Figure 6. The results indicate activation of 
CD8+ HCV specific T cells. 
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EXAMPLE 10 

Efficiency ofNSSa encoding DNA vaccine formulations to prime CTLs in 

mice 

Mice were immmiized with either 10-100 jxg of plasmid DNA encoding NS5a 
5 as described in Example 1 or with PLG-linked DNA encoding NS5a as described in 

Example 5. The immunizations were followed by a booster injection at 6 weeks of 

either 10-100 ng of plasmid DNA encoding for NS5a, 10^** adenovirus particles 

encoding NS345a, 1 .5 x 10^ lU SINCR (DC+) replicon particles encoding NS345a, 

or 10^ pfii vaccinia virus encoding NS5a. 
10 CTL activity and IFN-y expression were measured by the methods described 

in Examples 2 and 3. 

The results are shown in Table 8. The results demonstrate that priming with 

plasmid DNA encoding for NS5a or PLG-linked DNA encoding NS5a results in 

activation of CD8+ HCV specific T cells. 
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EXAMPLE 11 

Efficiency ofNS345b-encodingDNA vaccine formulations to prime CTLs in 

mice 

5 Mice were immimized with 10-100 (ig of plasmid DNA encoding NS34b by 

intramuscular injection to the tibialis anterior or with PLG linked DNA encoding 
NS5a as described in Example 5. The immunizations were followed by a booster 
injection of plasmid DNA encoding for NS5a as described in Example 1. 

CTL activity and IFN-y expression were measured by the methods described 
10 in Examples 2 and 3 , 

The results are shown in Table 9. The results demonstrate that priming with 
plasmid DNA encoding NS345b or PLG-linked NS345b results in activation of 
CD8+ HCV specific T cells. 
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Table 9: Efficiency of NS345b-Encoding DNA Vaccine Fonnulations to Prime CTLs in Mice 




CTL 
activity? 


YES 


YES 1 




fold increase 
vs. 'naked' 
DNA 


N/A 


1.67 




#of 
expts 




CO 


ICSforlFN-gamma 
(%CD8+ cells that are IFN-g+) 


responding 


%09 




# of mice 
tested 






SdtdevP 


0.16 


0.33 




Mean 


0.18 


0.30 




Boost 


NSSaDNA 


NSSaDNA 




NS345 DNA 
Vaccines 


NS345 DNA 


PLGNS345 DNA 
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EXAMPLE 12 

Administration of DMA via separate plasmids 

Mice were immimized with 100 (ig plasmid DNA encoding forNS345a or 
with 100 fig PLG-linked DNA encoding NS345a. Additionally, separate DNA 
5 plasmids encoding NS5a, NS34a, and NS4ab (33.3 jig each) were administered 
concurrently to another group of mice. Finally, PLG-linked DNA encoding NS5a, 
NS34a, and NS4ab (33.3 ^g each) were administered concurrently to anoflier group 
of mice. The immunizations were followed by a booster injection of IxlO'' pfu 
vaccinia virus encoding NS5a, 6 weeks post first immunization. 
10 IFN-Y expression was measured by the method described in Example 3. The 

results are shown in Figure 7. The results demonstrate a particularly vigorous 
response in the activation of CD8+ HCV specific T cells when the DNA is broken 
down into smaller sub units and linked to PLG. 

15 EXAMPLE 13 

Immunogenicity ofNS345Corej2rISCOMS in Mice 
Groups of 10 C57 black mice were immunized IM at 0, 21 and 60 days with 
the formulations shown in Table 10. The NS345Corei2rPLGdss group received a 
vaccine dose of 50 ^1 in each leg whereas the other vaccine groups received a vaccine 
20 dose of 50 jxl in one leg. 

NS345Corei2j-ISCOMS were comprised of amino acids 1242 to 3011 and 1- 
121 and the HCV polyprotein, numbered relative to HCV-land were adsorbed to 
ISCOMS with a ratio of protein to QH of approximately 8:1, using standard 
techniques. See, e.g., Intemational Publication No. WO 01/37869A. 
25 Core-ISCOMS including an HCV core pretein fragment firom the region 

spanning amino acid positions 1-191 of the HCV polyprotein, numbered relative to 
HCV-1, with a ratio of protein to QH of 1:1, were produced using standard 
techniques. See, e.g., Intemational PubUcation No. WO 01/37869 A. 

NS345Corei2, was formulated in 0. 1% SDS m PBS and contained DTT. 
30 Protein was diluted in PBS and mixed 1:1 with MF59 (see, Ott et al., "MF59 - 
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Design and Evaluation of a Safe and Potent Adjuvant for Human Vaccines" in 
Vaccine Design: The Submit and Adjuvant Approach (Powell, M.F. and Newman, 
MJ. eds.) Plenum Press, New York (1995) pp. 277-296; and U.S. Patent No. 
6,299,884) prior to immunization. 
5 For NS345Coiei2,-PLGdss, PLG microparticles produced as described above 

were treated with 3-(trimethylsilyl)-l-propanesulfomc acid (DSS) to enhance 
adsorption of antigen. DSS is commercially available from, e.g., Sigma Chemical 
Co., St. Louis, MO. NS345Corei2i was adsorbed thereto using standard techniques 
(see. International Publication No. WO 00/050006). The NS345Core,2rPLGdss was 

1 0 mixed with MF59 prior to immunization. 

As shown m Table 10, NS345Corem.ISCOMS produced antibody response 
only to NS5 m immunized C57 black mice. Higher levels of antibodies to NS5 were 
produced in mice immunized with NS345Coie,2i adjuvanted with MF59, however no 
antibody response to core, NS3 or NS4 was produced with this adjuvant either. 

15 Mice inmunized with CoreJSCOMS produced antibodies to core. In 

contrast, NS345Corei2i-PLGdss immunized mice produced significantly higher 
antibodies to NS5 thanNS345Core,2rISCOMS. In addition, NS345Corei2rPLGdss 
immunized mice produced antibodies to NS3 and some antibody response to core, 
but no antibodies to NS4. 
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Table 10. Immunogemcity of NS345Core,2,-Iscoms in Mice. Geometric mean EIA 
antibody titers to core and nonstructural proteins are shown. The number of 
responding mice per group are also listed. 


Vaccine 


IM Protein 


Anti-Core 


Anti-C33C 


Anti-ClOO 


Anti-NS5 




Dose (fig)' 


Antibody 


(NS3) 


(NS4) 


Antibody 






EIA GMT 


Antibody 
EIA GMT 


Antibody 
EIA GMT 


EIA GMT 


NS345Core 


6.0, 6.0, 6.0'' 


<10 


<10 


<10 


31 


nilSCOMS 




(0/10) 


(0/10) 


(0/10) 


(7/10.) 


Core- 


6.0,6.0,6.0" 


188 


<10 


<10 


<10 


ISCOMS 




(9/10) 


(1/10) 


(0/10) 


(2/10) 


NS345Core 


6.0, 6.0, 6.0 


<10 


<10 


<10 


279 


mMF59 




(2/9) 


(1/9) 


(0/9) 


(9/) 


NS345Cort 


10, 10, 10 


5 


50 


<10 


419 


mPLG- 




(6/10) 


(9/10) 


(2/10) 


(9/9) 


dss/MF59 













'Groups of 10 C57 black mice were immunized IM at 0, 21 and 60 days. Serum was 
obtained after the last immunization. TheNS345 CorejarPLGdss group received vaccine 
dose of 50 jil in each leg whereas the other vaccme groups received vaccine dose of 50 \il h 
20 one leg. 

''The ratio of protein to QH was approximately 8:1. 
"The ratio of protem to QH was approximately 1:1. 

EXAMPLE 14 

25 Immunogenicity of Different Formulations ofNS345Corej2i or NS345 in 

Mice 

Groups of 10 C57 black mice were immunized IM at 0, 30 and 60 days with 
the formulations shown in Tables 1 1 and 12. For the studies shown in Table 1 1, the 
NS345 and NS345Corei2i protein concentration was 10 |iig per dose, and for those in 
30 Table 12, the concentration was 5 |Lig per dose. 
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For PLG-NS345 (amino acids 1242 to 301 1 of the HCV polyprotein) and 
PLG-NS345Core,2, (amino acids 1242-3011 and 1-121 of flie HCV polyprotein), 
PLG microparticles were prepared and NS345 or NS345Coiei2i were adsorbed 
thereto using standard techniques, as desoibed above. 
5 For PLG-NS345 + PLG-CTAB-E1E2 DNA, PLG microparticles were 

prepared and NS345 was adsorbed to the microparticles as described above. E1E2 
DNA was produced as follows. Mammalian expression plasmid pMH-ElE2-809 
(Figure 4, ATCC Deposit No. PTA-3643) encodes an E1E2 fusion protein which 
includes amino acids 192-809 of HCV la (see, Choo et al, Proc, Natl Acad, Sci 

10 USA (1991) 88:245 1-2455). Chinese Hamster Ovary (CHO) cells were used for 
expression of the HCV E1E2 sequence from pMH-ElE2-809. In particular, CHO 
DG44 cells were used. These cells, described by Uraub et al., Proc. Natl Acad. Sci 
USA (1980) 22:4216-4220, were derived from CHO K-1 cells and were made 
dihydrofolate reductase (dhfr) deficient by virtue of a double deletion in the dhfr 

15 gene. DG44 cells were transfected with pMH-ElE2-809. The transfected cells were 
grown in selective medium such that only those cells expressing the dhfr gene could 
grow (Sambrook et al., supra). Isolated CHO colonies were picked (-800 colonies) 
into individual wells of a 96-well plate. From the original 96-well plates, replicates 
were made to perform expression experiments. The replicate plates were grown until 

20 the cells made a confluent monolayer. The cells were jSxed to the wells of the plate 
and permeablized using cold methanol. Anti-El and anti-E2 antibodies, 3D5C3 and 
3E5-1 respectively, were used to probe the Sxed cells. After adding an anti-mouse 
HRP conjugate, followed by substrate, the cell lines with the highest expression were 
determined. The highest expressing cell lines were then expanded to 24-well cluster 

25 plates. The assay for expression was repeated, and again, ttie highest expressing cell 
lines were expanded to wells of greater volume. This was repeated until the highest 
expressing cell lines were rapanded from 6-well plates into tissue culture flasks. At 
this point there was suflScient quantity of cells to allow accurate count and harvest of 
the cells, and quantitative expression assays were done. An ELISA was performed 

30 on the cell extract, to determine high expressors. 
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To produce the PLG-CTAB-E1E2 DNA, PLG microparticles were treated 
with CTAB as described above (see, Ihtemational Publication No. WO 00/050006). 

For PLG-NS345Corei2, + E1E2 DNA PLG-NS345Corei2i and E1E2 DNA 
were produced as described above. 
5 For PLG-NS345 or PLG-NS345Corei2i + MF59, PLG-NS345 or PLG- 

NS345Corei2i was combined with MF59 as described above. 

For PLG-NS345 or PLG-NS345Corei2i + CTAB-CpG, NS345 or 
NS345Corei2i was adsorbed to PLG as described above. The CpG molecule used 
was 5'-TCCATGACGTrCCTGACGTT-3* and this was treated with CTAB, as 
10 described above. 

For PLG-NS345 or PLG-NS345Corei2i + QS21, the saponin adjuvant QS21 
was combined with the PLG-HCV proteins. 

For PLG-NS345 or PLG-NS345Corei2i + CTAB-CpG + MF59, the various 
components, as described above, were combined. 
15 The remaining adjuvants used in the studies and shown in the tables are self- 

explanatory. 

The results of these studies are shown in Tables 1 1 and 12. As can be seen in 
Table 11, none of the formulations produced antibody responses to core, NS3 or NS4 
antigens. However , PLG-NS345+CTAB-CPG in MF59 produced the highest 

20 antibody titers to NS5. PLG-NS345Corei2i+QS21, PLG-NS345+CTAB-CPG, PLG- 
NS345Corei2,+CTAB-CPG, and PLG-NS345 + QS21 produced moderate antibody 
titers to NS5. The other formulations produced very low antibody titers to NS5. 

As can be seen in Table 12, NS345Corei2i /MF59/MPL and 
NS345Corei2i/MF59/CpG formulations produced very high antibody titers to 

25 NS345Corei2,. NS345Corei2i/MF59,NS345/MF59/CpG, and NS345Corei2i 

/MF59/Chol/QS21 formulations produced moderate antibody titers to NS345Core,2i. 
The other formulations produced very low or no antibody titCTS to NS345Corei2i. 
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Table 1 1. Immunogwiicity of different fonnulations of HCV NS345Core,2, or NS345 in 
Mice. Geometric mean EIA antibody titers to core and nonstructural proteins are shown. 


Vaccine" 


Anti-Core 
Antibody EIA 
GMT 


Anti-C33C 
(NS3) 
Antibo^EIA 
GMT 


Anti-ClOO 
(NS4) 
AntiboctyEIA 
GMT 


Anti-NS5 
Antibody EIA 
GMT 


PLG-NS345 


<10 


<10 


<10 


10 


PLG- 
NS345Core,2, 


<10 


<10 


<10 


15 


PLG- 
NS345+PLG- 
CTAB-E1E2 

DNA 


<10 


11 


<10 


23 


PLG- 
NS345Core,2, 
+E1E2 DNA 


<10 


<10 


<10 


20 


PLG-NS345 
+MF59 


<10 


<10 


<10 


70 


PLG- 
N.S345Coreij, 
+ MF59 


<10 


<10 


<10 


26 


PLG-NS345 + 
CTAB-CPG 


<10 


<10 


<10 


350 


PLG- 
NS345Core,j, 
+ CTAB-CPG 


<10 


<10 


<10 


271 


PLG-NS345 
+QS21 


<10 


<10 


<10 


201 


PLG- 
NS345Core,i, 
+ QS21 


<10 


<10 


<10 


505 
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PT O MQ^A^ 4- 

CTAB-CPG + 
MF59 




<10 


<10 


1471 




PLG- 


<10 


<10 


<10 


63 


5 


NS345Corei2i 












+ 












CTAB+MF59 











"Groups of 10 C57 black mice were immunized IM at 0, 30 and 60 days. Serum was 
1 0 obtained after the last immunization. The NS34S or NS345Corei2i protein concentration 
was 10 fig per dose. 
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Table 12. Immunogenicity of different fonnulatioiis of HCV NS345CoreB, or HCV 
NS345 in Mice. Geometric mean HA antibody titers to NS345Corei2i protein are shown. 


Vaccine" 


Anti-NS345Core,j, Antibody EIA GMT 


NS345Corem/MF59 


328 


NS345Ck)re,2i/MF59/CpG 


7,926 


NS3 4a+NS5B+Core/MF59 


12 


NS34a+NS5B+Core/MF59/CpG 


5 


PLG-NS345Core,2,/MF59 


<10 


PLG-NS345Core,2i/MF59/CpG 


<10 


PLG-NS345Core,„/PLG-CpG 


9 


NS34SCore,2i/alum phosphate 


34 


NS34SCorei2i/alum phosphate//CpG 


950 


NS345/MF59/CpG 


511 


PLG-NS345/PLG/CpG 


117 


NS345Core i2,/MF59/MPL 


■ 10,292 


NS345Core ,ji/MF59/Chol/QS21 


698 


NS345Core n/Alum phosphate/MPL 


23 



20 "Groups of 10 C57 black mice were immunized IM at 0, 30 and 60 days. Serum was 

obtained after the last immunization. The NS345 or NS345Core,2iprotein concentration was 
5 Jig per dose. 



25 
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EXAMPLE 15 

Lymphoproliferative Response of Different Formidattons ofNS345Corej2j or 
NS34A + NS35B + Core in Mice 

Groups of 8 C57 black mice were immimized IM at 0, 30 and 60 days with 
5 the fonnulatioiis shown in Table 13 and are as described above. Spleens were 

obtained after the last immunization. The NS345Corei2i protein concentration was 
25 ^ig per dose. The NS34a, NS5b and core doses were 3 |ag each. 

The results of this study are shown in Table 13. As can be seen, 
NS345Corei2i/Alum/CpG, PLG-NS345Core,2i/PLG/CpG, NS34a4-NS5B+ 
10 Core/MF59/CpG and PLG-NS345Corei2i/MF59/CpG formulations demonstrated 
strong LPA responses to NS5, NS34 and core antigens. The NS345Corei2i/MF59 
formulation also produced a strong LPA response to NS5 andNS34. Core was not 
tested. 

Moderate LPA responses were observed to NS5, NS34 and Core antigens with PLG- 
15 NS345Corei2i/MF59 and NS34a +NS5B + Ck>re/MF59 formulations. The 

NS345Corei2i/MF59/CpG formulation may not have been administered properly in 
that no LPA response was observed in this experiment. In a subsequent experiment 
as shown in Table 14, an LPA was observed to this formulation. 

Groups of 8 C57 black mice were immunized once IM with the formulations 
20 shown in Table 1 4, produced as described above, Draming lymph nodes were 
obtained. 

The NS345Corei2i protein concentration was 25 ^g per dose. 

The results of this study are shown in Table 14. As can be seen in Table 14, 
all the formulations tested produced a strong LPA response to NS5, NS34 and Core 
25 as well as the NS345Corei2i . 
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Table 13. Lymphoproliferative response of different formulations of HCVNS345Core,2i 


orNS34A+NS5B+Core in Mice. LPA responses (cpm) to core and nonstructural proteins 


are shown. The number of mice in each group responding is also indicated in. 




parentheses. 














HIV-2 env 






bOD-C200 


SOD-C22-3 


(background 


Vaccine* 






(Core) 


control) 


NS345Core|2i/MF59 


2250 


1800 


ND 


144 












ND345Corei2i/MF59/CpG 


80 


on 

80 


ND 


138 


PLG-NS345Corei2i/MF59 


«f\ 
jW 


1 OA 

IZO 


510 


93 








(2/8) 




PLG- 


1600 


1500 


620 


75 


NS345Core,2,/MF59/CpG 


(6/8) 


(6/8) 


(8/8) 




NS34a+NS5B+Core/MF59 


564 


710 


265 


76 




(8/8) 


(8/8) 


(8/8) 




NS34a+NS5B+ 


1523 


885 


446 


67 


Core/MF59/CpQ 


(8/8) 


(8/8) 


(6/8) 




PLG- 


3675 


2860 


370 


88 


NS345Core„,/PLG/CpG 


(8/8) 


(8/8) 


(8/8) 




NS345Core,2,/Almn/CpG 


8450 


7940 


1040 


82 




(8/8) 


(8/8) 


(6/8) 





"^Groups of 8 C57 black mice were immunized IM at 0, 30 and 60 days. Spleens were 
obtained after the last immunization. TheNS345Corei2, protein concentration was 25 jig 
per dose. The NS34a, NS5B and Core doses were 3 |ig each. 

25 
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EXAMPLE 16 

Immunogenicity of Recombinant HCV Protein Vaccines Adjuvanted with 
ISCOMS in Rhesus Macaques 

The safety and immunogenicily of HCV proteins completed with the 
5 adjuvant, Iscomatrix, was studied in Rhesus macaques. Three groups made up of 
four animals each were hnmunized IM as detailed below at week 0, 4 and 8 weeks. 
Vaccines were prepared as described above. The ISCOMS used lacked QH-A. 



Qtoup 
Number 


n 


Vaccine 


Delivoy 


1 


4 


Core-ISCOM (50 |ig in 1 ml) 


0.5 ml R Leg 
0.5 ml L Leg 


2 


4 


NS345Core,2,-ISCOM (1 mg in 1 
ml) 


0.5 ml R Leg 
0.5 ml L Leg 


3 


4 


Core-ISCOM (25 ^g in 0.5 ml) 
and NS5b-ISC0M (50 ^g in 0.35 
ml) 


0.5 ml Core-ISCOM R Leg 
0.35 ml NS5b-ISC0M L Leg 



15 Bleeds occurred as follows and immunogenicity was determined by CTL assays, 
lymphoproliferation assays, FACS analysis and antibody response a previously 
described (Palakos, et al. (2001) J. oflmmunoloev 166:35891. 
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Week 


Bleed date 


Immunized 


-10 






-1 


X 




0 




X 


2 


X 




4 




X 


6 


X 




8 




X 


10 


X 





The immunogenicity of the different HCV recombmant protein vaccines is 
shown in Tables 15-17. ' 
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Table 15. The Immunogenicity of HCV Core-ISCOMS vaccine two weeks post 
immunization and post 3" immunization as assessed by CTL assays, CD8+ FACS 
anafysis, LPA stimulation index and CD4+ FACS analysis 




2 weeks post 3° 


Macaqu 
e# 


CD8+ICS(CTL) 


CD4+ICS(LPASI) 


C 


NS 
3 


NS4 


NS5a 


NS5b 


C 


NS3 


NS4 


NS5a 


NS5b 


X020 


-(-) 










+(-) 










NOOl 


^(-) 










+(-) 










N086 


-(-) 










+(-) 










XOlO 


-(-) 










+(11) 

































15 





2 weeks post 2° 


Macaqu 
e# 


CD8+ ICS (CTL) 


CD4+ ICS (LPA SI) 


C 


NS 
3 


NS4 


NS5a 


NS5b 


C 


NS3 


NS4 


NS5a 


NS5b 


X020 


-(-) 










+/-(-) 










NOOl 


-(-) 










-(8) 










N086 


-(-) 










-K-) 










XOlO 


-(-) 










+/- 

(12) 
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Table 16. The Immunogenicity of HCV NS345Corem-ISCOMS vaccine two wedcs post 
2^ immunization and post 3"* immunization as assessed by CTL assays, CD8+ FACS 
analysis, LFA stimulation index and CD4+ FACS analysis 




2 weeks post 3 ° 


Macaqu 
e# 


CD8+ICS(CrL) 


CD4+ICS(LPASI) 


c 


NS3 


NS4 


NSSa 


NS5b 


C 


NS3 


NS4 


NSSa 


NS5b 


X016 


-(-) 


+(+) 


<-) 


-(-) 


<-) 


<-) 


+(-) 


<-) 


+K-) 


+/-(-) 


X008 


-(-) 


-(-) 


-(-) 


<-) 


-(-) 


-(-) 


-(-) 


<-) 


-(5) 


-(-) 


X021 


-(-) 


-(-) 


-(-) 


<-) 


-(-) 


<-) 


-(-) 


<-) 


<-) 


-(-) 


X023 


+/-(-) 


-(-) 


<-) 


+K-) 


-(-) 


<-) 


+/-(- 
) 


-(-) 


-(-) 


-(-) 





2 weeks post 2° 


Macaqu 
e# 


CD8+ICS(CTL) 


CD4+ICS(LPASI) 


C 


NS3 


NS4 


NSSa 


NSSb 


C 


NS3 


NS4 


NSSa 


NSSb 


X016 


-(-) 


+(+) 


<-) 


+(+) 


+(+) 


<-) 


■K-) 


+/-(-) 


+(-) 


■K-) 


X008 


+ (-) 


+(+) 


-(-) 


•K+) 


+(+) 


-(-) 


+(-) 


-(-) 


+-(-) 


+(-) 


X021 


-(-) 


-(-) 


+/-(-) 


-(-) 


+/-(-) 


-(-) 


-(-) 


+/-(-) 


<-) 


-(-) 


X023 


+/- 
(+) 


-K+) 


-(-) 


+/-(-) 


Mr) 


<-) 


+(-) 


<-) 


<-) 


7 
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Table 17. The Immunogenicity of HCV Core-ISCOMS + NS5b-ISC0MS vaccine two 
weeks post 2"* immunization and post 3"" immunization as assessed by CTL assays, CD8+ 
FACS analysis, LPA stimulation index and CD4+ FACS analysis 




2 weeks post 3° 


Macaqu 
e# 


CD8+ICS(CTL) 


CD4+ICS(LPASI) 


C 


NS 

3 


NS4 


NS5a 


NS5b 


C 


NS3 


NS4 


NS5a 


NS5b 


X022 


+(-) 








-0 


-(8) 










X014 


-(-) 








-(-) 


-K6) 








-Kll) 


N154 


-(-) 








-(-) 


-(-) 








+(-) 


N173 


-(-) 








<-) 


-(-) 








+/-(-) 

























15 





2 weeks post 2° 


Macaqu 
e# 


CD8+ICS(CTL) 


CD4+ICS(LPASI) 


C 


NS 

3 


NS4 


NS5a 


NS5b 


C 


NS3 


NS4 


NS5a 


NS5b 


X022 


-(-) 








-(+) 


-(-) 








-(-) 


X014 


•K-) 








+/-(-) 


-(-) 








+(.) 


N154 


-(-) 








-(-) 


"(6) 








+(8) 


N173 


-(-) 








-(-) 


+/-(-) 








+(6) 

























25 As can be seen in Table 15, the HCV Core-ISCOM vaccine produced no CTL 

positive responses in any of the 4 immunized macaques after the second or third 
• immunizations. No positive CDS y-interferon and/or TNF-a intracellular staining 
was also observed, although backgrounds were high in these particular arrays. At 
least two of four macaques produced a strong LPA response after the second 

30 immunizations, but only one remained positive after tihie third immunization. Two of 
four macaques produced positive CD4 intracellular staining after fhe second 
immunization and four of four after the third immunization. 
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As shown in Table 16, the HCV NS345Core,2i-ISCOM vaccine after the 
second immunization produced CTL positive responses to peptide pools representing 
two or more HCV proteins in three of four macaques (two of these macaques had 
responses to peptide pools from NS3, NS5a and NS5b, one to peptide pools from 
5 core and NS3). CDS positive y-interferon and/or TNF-a intraceUular staining to 

peptide pools representing two or more HCV proteins was positive in at least three of 
four macaques. One of four macaques produced a strong LPA response. At least 
three of four macaques produced CD4 positive intracellular staming to two or more 
HCV proteins. After the third immunization, only one of foxu: macaques had a 

10 positive CTL response, CDS positive intracellular staining and C04 positive 

intracellular staining. One other macaque had a positive LPA response and weak 
CD8+ CD4 intracellular staining. This decline in immunogenicity was likely due to 
instability of the vaccine formulation (see below). / 
As shown in Table 17, the HCV Core-ISCOM + NS5b-ISC0M vaccine 

1 5 produced a CTL positive response to NS5b in one of the 4 immunized macaques 
after the second immunization which did not remain positive after the third 
immunizaton. CDS positive intracellular positive staining was observed in one of 
four animals post second. Two of four macaques produced a strong LPA response 
after the second immunization which did not remain positive after the third 

20 immunization. Two otlier macaques did develop a strong LPA response after the 
third immunization. Three or four developed positive CD4 intracellular staining. 
One developed positive CDS intracellular staining. 

Three weeks after the third immunization, it was noted that the physical 
appearance of the polyprotein vaccine solution was visibly turbid. The core vaccine 

25 also was turbid but less so. The Core-NS5 vaccine was also sUghtly turbid. Analysis 
of this turbidity in the polyprotein formulation indicated that the ISCOM particles 
had precipitated into large aggregates. These aggregates could be dispersed by 
vortexing with 0.1% TWEEN SO detergent. It is probable that this change in the 
formxilation of the vaccine occurred before the last inmiunization. This observed 

30 change in appearance of the vaccines may have affected their immunogencity as 
cellular immune results declmed in all three vaccines. 
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The immunogenicity of HCV Core-ISCOMS, NS345Core,2rISCOMS and 
Core-ISCOMS + NS5b-ISC0MS as assessed by EIA antibody response is shown in 
Table 18, As can be seen, all three vaccines produced an antibody response by the 
third immunization to their corresponding HCV proteins, except for the 
5 NS345Corei2i-ISCOM vaccine. The NS345Corei2i-ISCOM vaccine produced 

antibody responses to NS3, NS4 and a very strong antibody response to NS5, but no 
antibody response to HCV core. 
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Table 18. The immimogenicity of HCV Core-ISCOMS, NS34SCorem-ISCOMS, Core- 
ISOCMS +NS5b-ISC0MS vaccine two weeks post 2** immunization and post 3"* 
inmiunization as assessed by EIA antibody response to HCV proteins. 


Vaccine 
Macaque 
ft 


Anti-Core EIA 
Antibody Titer 


Anti-NS3 EIA 
. Antibody Titer 


Anti-NS4EIA 
Antibody Tita: 


Anti-NS5 EIA 

Antibody Titer 


2™' 




JTOSl 

2™" 




rOSt 
2nd 


rOSt J 


FOSt 

2"" 


Post 3"* 


ISCOM 


















X020 


66 


226 














NOOl 


87 


46 














N086 


363 


396 














XOlO 


108 


137 

































NS345 
Corel21/ 
ISCOM 


















X016 


<10 


<10 


<10 


554 


56 


68 


3,590 


3,405 


X008 


<10 


<10 


66 


995 


14 


44 


2,109 


3,213 


X021 


<10 


' <10 


128 


6,330 


41 


204 


7,213 


8,083 


X023 


<10 


<10 


<10 


3,910 . 


64 


64 


1,243 


4,704 




















Core- 
ISCOM 

+NS5b- 
ISCOM 


















X022 


<10 


18 










<10 


134 


X014 


<10 


13 










<10 


693 


N154 


542 


554 










<10 


272 


N173 


28 


78 










<10 


258 
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EXAMPLE 17 

Immunization of Chimpanzees with Recombinant HCV Protein and DNA 
Vaccines 

Five groups offive chimps each were inmiimizedlM at 0, 0.7, 2 and 5 

months 

with the formulations presented below. Blood was collected at week 0, two weeks 
subsequent to the second immunization, two weeks following the third immunization 
and two weeks after the fourth immunization. 

Formulation 1: 20 jig E1E2 polypeptide + MF59 + 500 ^ig CpG (produced as 
described above); 

Formulation 2: 1 mg NS345Core,2,-ISCOM (produced as described above); 

Formulation 3: 6 mg each of CTAB-PLG-E1E2 (bp 574-2427, encoding 
amino acids 192-809 of the HCV polyprotein, numbered relative to HCV-1); CTAB- 
PLG-NS34a (bp 3079-5133, encoding amino acids 1027-171 1 of liie HCV 
polyprotein, numbered relative to HCV-1); CTAB-PLG-NS34ab (bp 4972-5916, 
encoding amino acids 1658-1972 of the HCV polyprotein, numbered relative to 
HCV-1); CTAB-PLG-NS5a (bp 5917-7260, encoding amino acids 1973-2420 of tiie 
HCV polyprotein, numbered relative to HCV-1); 

Formulation 4: 6 mg each of E1E2 DNA, NS34a DNA, NS34ab DNA and 
NS5a DNA, having the same coordinates as described above, delivered without PLG 
viaelectropomtion(see, e,g.,U.S. Patent Nos. 6,132,419; 6,451,002, 6,418,341, 
6233,483, U.S. Patent Publication No. 2002/0146831; and International Publication 
No. WO/0045823, for this deliveiy technique). Results are shown in Figures 8-10. 

As can be seen, in Figure 8, all vaccines were capable of priming CD4+ and 
CD8+ cells specific to HCV. Thus, all vaccines were successful at inducing a T cell 
response to HCV. Determination of the results for the PLG-DNA from formulation 3 
at two weeks subsequent to the fourth vaccination is in progress. 

As shown in Figures 9 and 10, multiple T cell specificities were induced by 
the two vaccines. Both vaccines primed T-cells specific for multiple T cell epitopes. 

As can be seen m Tables 19 and 20, E1E2 adjuvanted with MF59 primed 
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anti-ElE2 titers. CpG enhanced anti-ElE2 responses as well as THl responses and 
the ISCOM and the two DNA vaccines were capable of priming CD4+ and CD8+ T 
cell responses to HCV, 
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Thus, HCV polypeptides and polynucleotides, eith^ alone or as fusions, to 
stimulate cell-mediated immune responses, are disclosed. Although preferred 
embodiments of the subject invention have been described in some detail, it is 
understood diat obvious variations can be made without departing from the spirit and 
5 the scope of the invention as defined by the appended claims. 



88 



wo 2004/039950 



PCTAJS2003/033610 



We claim : 

1 . A fusion protein comprising HCV polypeptides, wherein the HC V 
polypeptides consist essentially of anNS3, anNS4, anNS5 and a core polypeptide of 

5 a hepatitis C virus (HCV), wherein said core polypeptide consists of amino acids 1 - 
121 of the HCV polyprotein, numbered relative to the full-length HCV-1 polyprotein. 

2. The fusion protein of claim 1 , wherein the NS5 polypeptide is an 
NS5a 

10 polypeptide. 

3. The fusion protein of claim 1, wherein the NS5 polypeptide is an 
NS5b 

polypeptide. 

15 

4. The fusion protein of claim 1 , wherein the NS5 polypeptide is an 
NS5a 

and an NS5b polypeptide. 

20 6. The fusion protein of claim 4, wherein the protein comprises the 

sequence 
of amino acids of SEQ ID N0:8. 

7. A fusion protein according to any of claims 1-6, wherein at least one 
25 of the HCV polypeptides is derived from a different strain of HCV than the other 

HCV polypeptides. 

8. A composition comprising: 

(a) a fusion protein according to any of claims 1-7; and 
30 (b) a pharmaceutically acceptable excipient. 
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5 



9. The composition of claim 8, further comprising an adjuvant, 

1 0. The composition of claim 8, further comprising a CpG 
oligonucleotide. 

1 1 . The composition of claim 8, wherein said fusion protein is adsorbed to 
or 

entrapped within a microparticle or ISCOM. 



10 12. The composition of claim 8, further comprising a polynucleotide 

encoding an HCV E1E2 complex. 

13. An isolated and purified polynucleotide that encodes a fusion protein 
according to any of claims 1-7. 

15 

' 14. A composition comprising: 

(a) the isolated and purified polynucleotide of claim 13; and 

(b) aphannaceutically acceptable excipient. 

20 15. The composition of claim 14, further comprising an adjuvant. 

16. The composition of claim 14, wherein said polynucleotide is adsorbed 
to 

or entrapped within a microparticle. 

25 

1 7. The composition of claim 14, further comprising a polynucleotide 
encoding an HCV E1E2 complex. 

18. A method of activating T cells of a vertebrate subject which recognize 
30 an epitope of an HCV polypeptide, comprising the step of: 

administering the composition of any of claims 8-12 to said vertebrate 
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subject, whereby a population of activated T cells recognizes an epitope of the NS3, 
NS4, NS5 and/or core polypeptides. 

1 9. A method of activating T cells of a vertebrate subject which recognize 
5 an epitope of an HC V polypeptide, comprising the step of: 

administering the composition of any of claims 14-17 to said vertebrate 
subject, whereby a population of activated T cells recognizes an epitope of the NS3, 
NS4, NS5 and/or core polypeptides, 

10 20. The method of claim 19 , wherein the polynucleotide is administered 

via 

electroporation. 

21 . Use of a composition according to any of claims 8-12 and 14-17 for 
15 activating T cells of a vertebrate subject which recognize an epitope of an HCV 

polypeptide, wherein a population of activated T cells recognizes an epitope of the 
NS3, NS4, NS5 and/or core polypeptides. 

22. Use of a fusion protein according to any of claims 1-7 for the 

20 manu&ctute of a medicament for activating T cells of a vertebrate subject vMch 
recognize an epitope of an HCV polypeptide, wherein a population of activated T 
cells recognizes an epitope of theNS3, NS4, NS5 and/or core polypeptides. 



25 



23. Use of a polynucleotide according to claim 13 for the manufacture of 
a 

medicament for activating T cells of a vertebrate subject which recognize an epitope 
of an HCV polypeptide, wherein a population of activated T cells recognizes an 
epitope of the NS3, NS4, NS5 and/or core polypeptides. 



30 
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1 10 

AW GC3G CCC ATC ACQ OOO TAC OCC CAG GAG 

.20 

T RO L hoc I I T S Ii TGR. 
ACA AGG <K3C CTC CTA OGG TQC AOJA ATC ACC AGC CTTA ACT GGC COG 

^0 40 

bkwqvbgbvqivsta 
■gac aaa aac caa- gtg qaq got gag gtc cag att gtg tca act .gct 

SO 

AQ TFL ATCIHGVCR T 
GOC CAA ACC TTC CTG GCA ACG TGC ATC AAT GGG GTG TGC TOG ACT 

60 70 

VYHGAGTRTIASPKG 
GTC TAC CAC GCJG GCC GGA ACG AGG ACC ATC GCG TCA CCC AAG GGT 

80 

^VlQMYTNVDQpiiVG 
OCT GTC ATC CAG ATG TAT ACC AAT GTA GAC CAA GAC CTT GTQ GGC 

100 

WPAPQQSRSliTP CT C 
TGG CCC GCT CCG C^A GOT AGC CGA TCA TTG ACA CCC TGC ACT TGC 

110 

O 8 SDIiY iiVTRHADVI 
GGC TCC TOG SAC CTT TAC CTG GTC ACG AGG CAC GCC GAT GTC ATT 

120 130 
PVRRRG DSRGeiiLSP 
CCC GTG CGC CGG CGG GGT GAT AGC AGG GGC AGC CTG CTG TCG CCC 

140 • 

RPlfiYLKGfifiGGPLL 
CGG- CCC ATT TCC TAC TTG AAA GGC TCC TCQ GGG GGT CCG CTG TTG 

"0 160 
CPAGHAV G IFRAAVC 
TGC CCC GCG GGG CAC GCC GTG GGC ATA TTT AGG GCC GCG GTG TGC . 

170 

T RGVAKAVD F I P V B H 
ACC CGT GGA GTG GCT AAG GCG GTG GAC TTT ATC CCT GTG GAG AAC 

180 

I* E T T M R S 
CTA GAG ACA ACC ATG AGG TCC 
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MATURE Bl 

SerPheSerllePheLeuLeuJaaLeuLeuSerCysLeXiThrValPrQiaaSerAlaTyx 192 

TCTTTCrCTATCTTCCTTCTGacCCrocriCTCn^ 

A0AiUlGAGATAGi^GQAAGACC6GGAOGA0AGAACGAAeiX3^ 



<31nValArg[AeiiSerThrGiyLeuTyrHisValThrAsnAspCVsP^ 2 12 

(2AAGTGCGCAACTCCAa3GGGCTCTACX:ACX3TCA(^^ 

GTTCaiaaCGrroAGGTGCCCCGAGATGGTGCAGTXW 

ValTyrGluAlaAlaAspAlallelieiiHisThrProGlyCysValProC^ 232 

GTGTACGAGGOSGCCGATGaaiTCCnXSGACACTCCXK^ 

CACATGCTCaK:a3GCTAlCKMTAGaACX3^ 

GlyAsiaAlaSerArgCyBTrpValAlaMetThrProThiValM 252 

GaCAACXKXnXXaaGTGTTOaGT^ 

CCGTTGCX3GAGCTCaia^CX:CACOTCTAC^^ 

IieuProAlaThrGlnLeuArgArgHielleAspLexiLeuValGlySerAlaThrljeuC^ 272 

CTCCCCGCGACXSCAGCTrCGAOSTCACATC^ 

GAGGGGCGCTGCGTCGAAGCTGOlGTGTAGCTMACGa^ 

SerAlaLeuiyi^V^alGlyAspLeuCVsGlySerValPheLeuValGlyGlnLeuPheT^ 2 92 

TCGGCCCTXJTACOTOGGGGACCTGTQCXSGGl^^ 

AiGCCGG3AGATGGACCX!CCFGGACACGCCC^^ 

PheSerProArgArgHisTrpThrThrGlnGlyCVaAsnCysSerlleTyrPrpGly^ 3 12 

TrCTCTCCa^GGCGCCACTGGACGAOGGAAGGTTGCAATTGCT 

AAGAGA(303T<XK3CGGTaACCTOClXK:GTTCC^^ 

IleThrQlyHlsArgMetAlaTrpAepMetMetMetJ^nTrpSerProThrt 332 

ATAACGGGTGACCGCSlTGGCATGGGATATCATGATGAAe^ 

TATroCCGAGTGGCGTACCGTACCCTATACTACTACTTGACCAGGGGATO 

ValMetAIaGlnLeuI^eUArglleProGlnAlalleLeiiAspMetlleAlaGlyAlaH 352 - 

<3EAa!rGQCTCAGCTOCTCa3GATCX2CAC^^ 

CATTACCmGTCGACGAGGCCmGGGTQTrCXS^^ 

TrpGlyValLeuAlaGlylleAlaTyrPheSerMetValGlyAenTrpAlaLysVallieu 372 ' 

TGGGGAGTCCrGGCXSGGaiTAGaSTATTTCK^ 

ACCCCTKaiGGACaSCCCaTATCXSCMAAi^ 

B2 

ValValLexiLeuLeuPheAlaGlyValAepAlaGluThrHieValThrGlyGlySerAla 392 
GTAGTGCTGCTGCTATOTGCCGGCGTCGACXSCGGAAACCCACW^ 

CyiTCaiC3GACaACaATAAACGG(X!QCAGCnXK!GCCm ' 

GlyHiBThrValSerOlyPheValSerLeuLeuAlaProGlyAlaLysGlnAsnValQln 412 

GGCOlCyiCTGTGTCTGGATTTGTTAGCrCTCCTasa^CCyvGGCGCa^ 

CCQGTGTGACAOAGACCTAAACaVATCQaAGaAGCGTGGTCaKJGGTTCaTCT 
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LeuIleAsnThrAsnGlySerTrpHisLeuAsnSerThrMalieuAsnCysAen^ 43 2 
GACrAaTTGTG(mXKX!GaX3UU!CGTaaAaTTAaX^ 



LeuAsnThrGlyTrpl^euAlaGlyLeuPheTyrHisHleLyePheAsxiSerSerGlyCV^ 452 

CTC^CauXGGCTGGTTQGCAaGGCTTTTCTA 

GAGTTGTGGTCGACa^CCXSTCamAAAGATAG 

ProGlimrgLeuAlaSerCYsArgProLeuThrAspPheAspGlnGlyTrpGlyProIle 472 

CCTGAGAGGCTAGCCMei^CGACX:CCTTACCX3ATl^^ 

QGACTCrrCCmTCGGT0GAC!GaCK3^^ 

SerTyrAlaAenGlySerGlyProAspGlnArgProTyrCysT^ 4 92 

AGTTA1XK:C3UU2GGMGCX3GCCCCGACCA^ 

TaUlTACGGTTGCXJTTOMaSGGGC^^ 

ProCysGlylleValProAlaliysSerValCysGlyProValTyrCyePheThrProSer 512 

ccttgcx3gtaxtgtgcx!cga3aagagtgtgtotggtc^ 

ggaacx3ocataacacxxkk:xk:ttctc^^ 

ProValValValGlyThrTtoAspArgSerGlyAlaProThrlVrSerT^ 532 

CCOGTGGTGGTGGGAACXaACCGACAGGTCGGGaSCGCCCACGTAC^ 

GGGCaCCACCACCCTTaCTGGCTGT^^ 

AspThrAspValPheValljeuAstiAsnThrArgProPTOLeuaiyAsnTrpPheGlyCys 552 
GATACGGACGTCTTTOTCCmSUiqUlTACCM^ 

CTATGCCTGCAGAAGCAGGAATTGTmT^ 

ThrTrpMetAsnSerThrGlyPheThrLyflValCysQlyAlaProProCyeVallleGly 572 

ACCTGGArGAACTCAACTGGATTaiCCAAAGTGTGCGGAG<^ 

TGGACCTACTTGAGTTGATCTAAGTGGTTTCACaiaSC 

GlyAlaGlyAsnAsnThrLeuHlsCVsProThrABpCyBPheArgLysHlBProAApAla 592 
GGGGCGGGCaU^CAAOlCCCTGCAC^^ 

aXICGCCXXmxSTTGTGGGAOGTOACX^ 

ThrTyrSorArgCVflGlySerGlyProTrpIleThrProArgCyeLeuValAspl^ 612 

ACATACTCTCGGTGCGGCTCCX3GTCCCTlSGATa^ 

TGTATaA0AGC<3l(:XK:OGAGGCC3WK3^ 

TyrArgLeuTrpHl sTyrProCyellirlleAsnTyrThrllePheLys IleArgMetTyr 632 

TATAGGCTTTGGCyiTTATCCrTGTACCATCAACTACACTATATTT^^ 

ATATCOGAAACCGTAATAGGAACATGGTAGTTGATGTGATATAAATTTTAGTCCT 

ValGlyGlyValGluHisArgLeuGluAlaAlaCyeAsnTrpThrArgGlyGluArgCys 65 2 

GTGGGAQGGOTCGAGCaiCAGaCTOGAAaCTGCCTO^ 

CACCCTCX:CCAGCTCaTGTCXJaACCTTOGA^^ 

AspLeuGliiAspArgAspArgSerGlxxLeuSerProLeiiLeuLeuThrThr^rGln 672 

GATCTGGAAGATAGGGACAGGTCCGAGCTCMCCraTTACrroC^^ 

CTAGACX^rTCIATCCCrGTCXJAGGCTC 
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OVOaTCCTCCeaTGTTCCTTCACAACCCTGCCy^ 
GTCCAGGAG<3OCACyVAGGAA0TGTTGGGAa3GlX:XK3A^ 

HiBGlnfifinlleValAspValGlnTyrLeuTyrQlyVaiGlySerSerlleAlaSerTrp 712 

CACCAQ]ViLCATTGTQGAC0TGCAGTACTTGTACG^ 

OTGGTCTTGTA&CAa^TGOlCGTCSlTQAAGM^^ 

Alal leLysTrpGluTyrValValLeuLeuPheLexiLeuIieuAlaAspAlaArgValCys 732 

GCaiTTAAGTGGGAGTACGTCGTCCTCCTGTTCCTTCTGCTT^ 

aSOTAATTCACCCTGATGCAGGAGGAGGACAAG 

P7 

SerCysIieuTrpMetMetLetLLeuIleSerGlnAlaGluAlaAlaLeuGliiAsxiL 1S2 

TCX^TGCTTOTGGATGATXKTTACTCyiTATCCCAAGC^^ 

AGQACGAAGACCTACTACGATGAGTATAGGGTOXK^ 

IleLeuAsnAlaAlaSerLeuAldLGlyThrHlBGlyLeuValSerPheLeuValPbePhe 7*72 
ATACXXAATGCAGCATCCCTOGCOSGGACGCAC^ . 
TATGAATTACGTCGTAGGGACCGGCCCTGCGT^^ 

CysPheAlaTrpTyrLeuLyeGlyLysTrpValProGlyAlaValTyrThrPhe^^ 792 

TOCTTTOaVTGGTATCTGAAGGGTAAGTGGGTGCCCGGAGCGGTC 

ACmAACGTACCa^TAGACTTCCCATTaicCCACGGG^ 

MetTrpPiToLeuLeiilieiiLeiiLetiLeiiAlalieuProGlxiArgAlaTyrAlaTC 809 
ATOTOTCCTCrCCTCePGCT^^ 

TAGACa3GaU}AGOAGQACG2VGQACSUlCGGGAAC^^ . 



FIG. 3C 



wo 2004/039950 



PCT/US2003/033610 



6/21 




SV40polyA 
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MAAYAAQGYK 
ATG GCT GCA TAT GCA GCT GAG 6GC TAT AAG 

20 

v lvlnpsv aa^'tlgfg 

GTG CTA GTA CTC AAC CCC TCT GTT GCT GCA ACA CTG GGC TTT GGT 

30 40 
AYMSKAHGIDPNIRT 
GCT TAG ATG TCC AAG. GCT CAT GGG ATG GAT GGT AAG ATG AGG ACC 



50 
P 



S 



GGG GTG AGA ACA ATT ACC ACT GGC AGG CCC ATG AGG TAG TCC ACC 



K 



60 
L 



70 
Y 



TAG GGC AAG TTC GTT GGC GAG GGC GGG TGC TCG GGG .GGC GCT TAT 

80 

DI I ICDEG H STOA TS 
GAG ATA ATA ATT TGT GAG GAG TGC CAC TCG AGG GAT GGC ACA TCC 

• 90 100 
ILGIGTVLDQAETAG 
ATG TTG GGG ATT GGG ACT GTG GTT GAG CAA GCA GAG ACT GGG GGG 

110 

ARLVV LATATPPGSV 
GGG AGA CTG GTT GTG CTG GGC ACC GGC ACC GCT GGG GGC TCC ' GTG 

120 130 
TVPHPN IEEVALSTT 
ACT GTG GGG CAT GGC AAG ATG GAG GAG GTT GCT CTG TGC ACC ACC 

140 

GEIPFYG^KAIPLEVI 
GGA GAG ATG GCT. TTT TAG GGC AAG GCT ATG CCC GTG GAA GTA ATG 

150 160 
KGGR HLIFCHSKKKC 
AAG GGG GGG AGA CAT CTC ATG TTG TGT GAT TGA AAG AAG AAG TGC 



170 

DE L AAKLVALG i'NAV" 
GAC GAA CTC GGC- GCA AAG CTG GTG GCA TTG GGC ATG AAT GGC GTG 

180 190 
AYYRGLDVSVIPTSG 
GGC TAG TAG GGG GGT GTT GAC GTG TCC GTG ATC CCG ACC AGG GGC 
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200 

DVVVVATDALMTGYT 
GAT GTT GTC GTC GTG GCA ACC GAT GCC CTC ATG ACC GGC TAT ACC 

210 220 
GDFD SVIDCNTCVTQ 
GGC GAC TTC GAC TCG GTG ATA GAC TGC AAT ACG TGT GTC ACC CAG 

230 

TVDFSLD PTFTIETI 
ACA GTC GAT TTC AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA ATC 

- 240 250 

TLPQ DAVSRTQRRGR 
ACG CTC CCC CAA GAT GCT GTC TCC CGC ACT CAA CGT CGG GGC AGG 

260 

TGRGKPGIYRFVAPG 
ACT GGC AGG GGG AAG CCA GGC ATC TAC AGA TTT GTG GCA CCG GGG 

270 280 
ER. PSGMFDSSVLCEC 
GAG CGC CCC TCC GGC ATG TTC GAC TCG TCC GTC CTC TGT GAG TGC 

290 

VDAGCAWYELTPAET 
TAT GAC GCA GGC TGT GCT TGG TAT GAG CTC ACG CCC GCC GAG ACT 

300 310 
TVRL RAYMN TPGLPV 
ACA GTT AGG CTA CGA GCG TAC ATG.AAC ACC CCG GGG CTT CCC GTG 

320 

CQDHL EFWEGVFTGL 
TGC CAG GAC CAT CTT GAA TTT TGG GAG GGC GTC TTT ACA GGC CTC 

330 340 
T HIDAHFLSQTKQSG 
ACT CAT ATA GAT GCC CAC TTT CTA TCC CAG ACA AAG CAG AGT GGG 

350 

ENLPYLVAYQATVCA 
GAG AAC CTT CCT TAC CTG GTA GCG TAC CAA GCC ACC GTG TGC GCT 

360 370 
RAQ. AP PPSWDQMWKC 
AGG GCT CAA GCC CCT CCC CCA TCG TGG GAC CAG ATG TGG AAG TGT 



380 

LIRLKPTLHGPTPLL 
TTG ATT CGC CTC AAG CCC ACC CTC CAT GGG CCA ACA CCC CTG CTA 
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390 
A 



N 



H 



400 

P 



TAG AGA CTG GGC GOT GTT GAG AAT GAA ATC ACC CTG ACG CAC CCA 

• 410 

VTKYIMTCMSADLEV 
GTC ACC AAA TAC ATC AtG ACA TGC ATG TCG GCC GAC CTG GAG GTC 

420 430- 
VTSTWVLVGG VLAAL 
GTC ACG AGC ACC TGG GTG CTC GTT GGC GGC GTC CTG GCT GCT TTG 

. 440 

'^A AYCLSTGCVVIVGR 
GCC GCG TAT TGC CTG TCA ACA GGC TGC GTG GTC ATA GTG GGC AGG 

450 460 
VVLSGKPAIIPDREV 
GTC GTC TTG TCC GGG AAG CCG GCA ATC ATA CCT GAC AGG GAA GTC 

470 

LYREFDEMEECSQHL 
CTC TAC CGA GAG TTC GAT GAG ATG GAA GAG TGC- TCT CAG CAC TTA 

480 490 
.PYI EQGMMLAEQ FKQ. 
CCG TAC ATC GAG CAA GGG ATG ATG CTC GCC GAG CAG TTC AAG CAG 

500 

KALGLLQTASRQAEV 
AAG GCC CTC GGC CTC CTG CAG ACC GCG TCC CGT CAG GGA GAG GTT 

510 520 
lAPAVQTNWQKLETF 
ATC GCC CCT GCT GTC CAG ACC AAC TGG CAA AAA CTC GAG ACC TTC 

530 

WAKHMW NFISGIQYL 
TGG GCG AAG CAT ATG TGG AAC TTC ATC AGT GGG ATA CAA TAC TTG 

540 550 
AGLSTLPGNPAIAS L 
GCG GGC TTG TCA ACG GTG GCT GGT AAC GCC GGC ATT GCT TCA T^TG 

560 

MAFTAAVTSPLTTSQ 
ATG GCT TTT ACA GCT GCT GTC ACC AGC CCA CT^ ACC ACT AGQ CAA 

570 580 
TLLFNILGGWVAAQL 
ACC CTC CTC TTC AAC ATA TTG GGG GGG TGG GTG GCT. GCC CAG CTC 
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590 

AAPGAATAFVGAGLA 
GCC GCC CCC GOT GCC GCT ACT GCC TTT GTG GGC GCT GGC TTA GOT 

600 610 
GAAIGSVGLGKVLID 
GGC GCC GCC ATC GGC AGT GTT GGA CTG GGG AAG GTC CTC ATA GAC 

620 

I LAGYG AGVAGALVA 
ATC CTT GCA GGG TAT GGC GCG GGC GTG GCG GGA GCT CTT GTG GCA 

. 630 640 

FKIMSGEVPSTEDLV 
TTC AAG ATC ATG AGC GGT GAG GTC CCC TCC ACG GAG GAC CTG GTC 

650 

N L L P A I L S P GALV V G 
AAT CTA CTG CCC GCC ATC CTC TCG CCC GGA GCC CTC GTA GTC GGC 

660 670 

VVCAAILRRHVGPGE 
GTG GTC TGT GCA GCA ATA CTG CGC CGG CAC GTT GGC CCG GGC GAG 

680 

GAVQWMNRLIAFASR 
GGG GCA GTG CAG TGG ATG AAC CGG CTG ATA GCC TTC GCC TCC CGG 

690 . 700 

gnhvspthyvp'esda 
ggg aac cat gtt tcc ccc acg cac tac gtg ccg gag agc gat gca 

710 

AARVTAILSSLTVTQ 
GCT GCC CGC GTC ACT GCC ATA CTC AGC AGC CTC ACT GTA ACC CAG 

' 720 730 
LLRRLH QWISSE CTT 
CTC CTG AGG CGA CTG. CAC CAG TGG ATA AGC TCG GAG TGT ACC ACT 

740 

•PCSGSWLRDIWDWIC 
CCA TGC TCC GGT TCC TGG CTA AGG GAC ATC TGG GAC TGG ATA TGC 

750 760 
EVLSDFKTWLKAKLM 
GAG GTG TTG AGC GAC TTT AAG ACC TGG CTA AAA GCT AAG CTC ATG 

770 

•PQLPGIP.FVSCQRGY 
CCA CAG CTG CCT GGG ATC CCC TTT GTG TCC TGC CAG CGC GGG TAT 
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'780 790 
K.GVWRGDGIMHTRCH 
AAG GGG GTC TGG CGA GGG GAC GGC ATC ATG CAC ACT CGC TGC CAC 

800 

CGAEITGHVKNGTMR- 
TGT GGA GCT GAG ATC ACT GGA CAT GTC AAA AAC GGG ACG ATG AGG 

810 820 
IVGPRTCR NMWSGTF 
ATC GTC GGT CCT AGG ACC TGC AGG AAC ATG TGG AGT GGG ACC TTC 

830 

PINAYTT GPCTPLPA 
CCC ATT AAT GCC TAC ACC ACG GGC CCC TGT ACC CCC CTT CCT GCG 

840 850 
PNYTFALWRVSAEEY 
CCG AAC TAC ACG TTC GCG CTA TGG AGG GTG TCT GCA GAG GAA TAC 

860 

VEI RQVGDFHYVTGM 
GTG GAG ATA AGG CAG GTG GGG GAC TTC CAC TAC GTG ACG GGT ATG 

870 880 
.TT DNLKCPCQVPS PE 
ACT ACT GAC AAT CTT AAA TGC CCG TGC CAG GTC CCA TCG CCC GAA* 

890 

FFTELDGVRLHRFAP 
TTT TTC ACA GAA TTG GAC GGG GTG CGC CTA CAT AGG TTT GCG CCC 

900 910 
PCKPLLREEVSFRVG 
CCC TGC AAG CCC TTG CTG CGG GAG GAG GTA TCA TTC AGA GTA GGA 

920 

LHEYPVGSQLPCEPE 
CTC CAC GAA TAC CCG GTA GGG TCG CAA TTA CCT TGC GAG CCC GAA 

930 940 
PDVAVLTSMLTDPSH 
CCG GAC GTG GCC GTG TTG ACG TCC ATG CTC ACT GAT CCC TCC CAT 

950 

ITAEAA GRRLARGSP 
ATA ACA GCA GAG GCG GCC GGG CGA AGG TTG GCG AGG GGA TCA CCC 



960 970 
PSVASSSASQLSAPS 
CCC TCT GTG GCC AGC TCC TCG GCT AGC CAG CTA TCC GCT CCA !PCT 
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980 

LKAT .CTANHDSPDAE* 
CTC AAG GCA ACT TGC ACC GCT AAC CAT GAC TCC CCT GAT GCT GAG 

990 1000 
LI.EANLLWRQEMGGN 
CTC ATA GAG GCC AAC CTC CTA TGG AGG CAG GAG ATG GGC GGC AAC 

1010 

ITRVESENKVVILDS 
ATC ACC AGG GTT GAG TCA GAA AAC AAA GTG GTG ATT CTG GAC TCC 

1020 1030 
F DPLVAEEDEREISV 
TTC GAT CCG CTT GTG GCG GAG GAG jSAC GAG CGG GAG ATC TCC GTA 

1040 

PAE I LRKSRRFAQAL 
CCC GCA GAA ATC . CTG CGG AAG TCT CGG AGA TTC GCC CAG GCC CTG 

1050 1060 
PVWARPDYNPPLVB. T 
CCC GTT TGG GCG CGG CCG GAC TAT AAC CCC CCG CTA GTG GAG ACG 

1070 

WKKPDYEPPVVHGCP 
TGG AAA AAG CCC GAC TAG GAA CCA CCT GTG GTC CAT GGC TGC CCG 

1080 1090 
LPPPKSPPVPPPRKK 
CTT CCA CCT CCA AAG TCC CCT CCT GTG CCT CCG CCT CGG. AAG AAG 

1100 ' 

RTVVLTESTLSTALA 
CGG ACG GTG GTC CTC ACT GAA TCA ACC CTA TCT ACT GCC TTG GCC 

1110 1120 
ELATRS. FGSSSTS'GI 
GAG CTC GCC ACC AGA AGC TTT GGC AGC TCC TCA ACT TCC GGC ATT 

1130 

TGDNTTTSSEPA PSG 
ACG GGC GAC .AAT ACG ACA ACA TCC TCT GAG CCC GCC CCT TCT GGC 

1140 1150 
CPPDSDAESYSSMPP 
TGC CCC CCC GAC TCC GAC GCT GAG TCC TAT TCC TCC ATG CCC CCC 

1160 

LEGEPGDPDLSDGSW 
CTG GAG GGG GAG CCT GGG GAT CCG GAT CTT AGC GAC GGG TCA TGG 
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1170 1180 
STVSSEANAEDVVCC 
TCA ACG GTC AGT AGT GAG GCC AAC GCG GAG GAT GTC GTG TGC TGC 

1190 

SMSYSWTGALVTPCA 
TCA ATG TCT TAG TCT TGG ACA GGC GCA CTC GTC ACC CCG TGC GCC ' 

1200 1210 
ABEQKLPINALSNSL 
GCG GAA G^A CAG AAA CTG CCC ATC AAT GCA CTA AGC AAC TCG TTG 

1220 

LRHHNLVYSTTSRSA 
CTA CGT CAC CAC AAT TTG GTG TAT TCC ACC ACC TCA CGC AGT GCT 

1230 1240 
CQRQ KKVT.F DRLQVL 
TGC CAA AGG CAG AAG AAA GTC ACA TTT GAC AGA CTG CAA GTT CTG 

1250 

DSHYQDVLKEVKAAA 
GAC AGC CAT TAC CAG GAC GTA CTC AAG GAG GTT AAA GCA GCG GCG 

1260 . 1270 

SKVKANLLSVEEACS 
TCA AAA GTG AAG GCT AAC TTG CTA TCC GTA GAG GAA GCT TGC AGC 

1280 

LTPPHSAK'SKFGYGA 
CTG ACG CCC CCA CAC TCA GCC AAA TCC AAG TTT GGT TAT GGG GCA 

1290 1300 
KDVRCHARKAVTH IN 
AAA GAC GTC CGT TGC CAT GCC AGA AAG GCC GTA ACC CAC ATC AAC 

1310 

SVWKDLLEDNVTPI D 
TCC GTG TGG AAA GAC CTT CTG GAA GAC AAT GTA ACA CCA ATA GAC' 

1320 1330 
T T I MAKNEVFCVQ.P E 
ACT ACC ATC ATG GCT AAG AAC GAG GTT TTC TGC GTT CAG CCT GAG 

1340 

KG6RKPARLIVFPDL 
AAG GGG GGT CGT AAG CCA GCT CGT CTC ATC GTG TTC CCC GAT CTG 

1350 1360 

GVRVCEKMALYDVVT 

GGC GTG CGC GTG TGC GAA AAG ATG GCT TTG TAC GAC GTG GTT ACA 

1 
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1370 

KLPLAVM.GSSYGFQY 
AAG CTC CCC TTG GCC GTG ATG GGA AGC TCC TAG GGA TTC CAA TAG 

1380 1390 
S PGQRVE. FLVQAWKS 
TCA CCA GGA CAG CGG GTT GAA TTC CTC GTG CAA GCG TGG AAG TCC 

1400 

KKTPMGFSYDTRCFD 
AAG AAA ACC CCA ATG GGG TTC TCG TAT GAT ACC CGC TGC TTT GAC 

1410 1420 
-STVTESDIRTEE'aiY 
TCC ACA GTC ACT GAG AGC GAC ATC CGT ACG GAG GAG GCA ATC TAC 

1430 

Q C C DL D PQARVA. I K s" 
CAA. TGT TGT GAC CTC GAC CGC CAA GCC CGC GTG GCC ATC AAG TCC 

1440 1450 
LTERLYVGGPLTNSR 
CTC ACC GAG AGG CTT TAT GTT GGG GGC CCT CTT ACC AAT TCA AGG 

1460 

GENCGYRRCRASGVL 
GGG GAG AAC TGC GGC TAT CGC AGG TGC CGC GCG AGC GGC GTA CTG 

1470 . 1480 

TTSCGNTLTCYIKAR 
ACA ACT AGC TGT GGT AAC ACC PTC ACT TGC TAC ATC AAG GCC CGG 

1490 

AAC RAAGLQDCTMLV 
GCA GCC TGT CGA GCC GCA GGG CTC CAG GAC TGC ACC ATG CTC GTG 

1500 1510 
CGDDLVVICESAGVQ 
TGT GGC GAC GAC TTA GTC GTT ATC TGT GAA AGC GCG GGG GTC CAG 

1520 

EDAAS L RAFTEAMT R 
GAG GAC GCG GCG AGC CTG AGA GCC TTC ACG GAG GCT ATG ACC AGG 

1530 1540 
Y SAPPGDPPQPEYDL 
TAC TCC GCC CCC CCT GGG GAC CCC CCA CAA CCA GAA TAC GAC TTG 



1550 

ELITSCSSNVSVAHD, 
GAG CTC ATA ACA TCA TGC TCC TCC AAC GTG TCA GTC GCC CAC GAC 
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1560 ^ 1570 

GAGKRVYYLT RDPTT. 
GGC GCT GGA AAG AGG GTC TAC TAG CTC ACC CGT GAC CCT ACA ACC 

1580 

PLARAAWETARHT PV- 
CCC CTC GCG AGA GCT GCG TGG GAG ACA GCA AGA CAC ACT CCA GTC 

1590 1600 
N SWLGNI IMFAPTLW 
AAT TCC TGG CTA GGC AAC ATA ATC ATG TTT GCC CCC ACA CTG TGG 

1610 

ARMILMTHFFSVLIA 
GCG AGG ATG ATA CTG ATG ACC CAT TTC TTT AGC GTC CTT ATA GCC 

1620 1630 
RDQLEQAliDCEIY GA 
AGG GAC CAG CTT GAA CAG GCC CTC GAT TGC GAG ATC TAC GGG GCC 

1640 

CYSIEPLDLPPIIQR 
TGC TAC TCC ATA GAA CCA CTG GAT CTA CCT CCA ATC ATT CAA AGA 

, 1650 1660 

L H G .L S A F S L H S Y S P G* 
CTC CAT GGC CTC AGC GCA TTT TCA CTC CAC AGT TAC TCT CCA GGT 

1670 

EINRVAACLRKLGVP 
GAA ATC AAT AGG GTG GCC GCA TGC CTC AGA AAA CTT GGG GTA CCG 

1680 1690 
PLRAWR^HRARSVRAR 
CCC TTG CGA GCT TGG AGA CAC CGG GCC CGG AGC GTC CGC GCT AGG 

1700 

LLAR.GGRAAICGKYL 
.CTT CTG GCC AGA GGA jSGC AGG GCT GCC ATA TGT GGC AAG TAC CTC 

1710 1720 
FNWAVRTKLK LTPIA 
TTC AAC TGG GCA GTA AGA ACA AAG CTC TVAA CTC ACT CCA ATA GCG 

1730 

AAGQLDLSGWFTAGY 
GCC GCT GGC CAG CTG GAC TTG TCC GGC TGG TTC ACG GCT GGC TAC 



1740 1750 
SGGDIYHSVSHARPR 
AGC GGG GGA GAC ATT TAT CAC AGC GTG TCT CAT GCC CGG CCC CGC 



F/G. fir 
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1760 

WIW FCLLLLAAG V6I 
TGG ATC TGG TTT TGC CTA CTC CTG CTT GCT GCA GGG GTA GGC ATC 

1770 1780 
YLIi.PNRMSTNPKPQR 
TAG CTC CTC CCC AAC CGA ATG AGC ACG AAT CCT AAA CCT CAA AGA 

1790 

KTKRNTNRRPQDVKF 
AAG ACC AAA CGT AAC.ACC 7VAC CGG CGG CCG CAG GAC GTC AAG TTC 

1800 1810 
PGGGQIVGGVYLLPR 
CCG GGT GGC GGT CAG ATC GTT GGT GGA.GTT TAC TTG TTG CCG CGC 

1820 

RGPRLGVRATRKTSE 
AGG GGC CCT AGA TTG GGT GTG CGC GCG ACG AGA AAG ACT TCC GAG 

1830 1840 
RSQPRGRRQPIPKAR 
CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT ATC CCC AAG GCT CGT 

1850 

RPEGRTWAQPGYPWP 
CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG TAC CCT TGG CCC 

1860 1870 
L'YG NE GCGWAGWLL S 
CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG CTC CTG TCT 

1880 

PRGSRPSW.GPTDPRR 
CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC CGG CGT 

1890 1892 
R S R N L G K 
AGG TCG CGC AAT TTG GGT PiRG 
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SEQUENCE LISTING 

<110> HOUGHTON, Michael 
CQ2VTES, Steve 
SELBY, Mark 
PALIARD, Xavler 

<120> ACTIVATION OF HCV-SPECIPIC T-CELLS 

<130> 2300-1612,60, 

<150> 10/281,341 
<151> 2002-10-25 

<160> 10 

<170> PatentIn Ver. 2.0 

<210> 1 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; fusion protein 
epitope 

<400> 1 

His Glu Tyr Pro Val Gly Ser Gin Leu 
1 5 



<210> 2 
<2H> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: fusion protein 
epitope 

<400> 2 

Ala Glu Leu' He Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly 
15 10 15 



<210> 3 
<211> 1914 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HCV-1 El/E2/p7 region 

<220> 

<221> CDS 

<222> (1) . . (1911) 

<400> 3 

tot ttc tct ate ttc ctt ctg gcc ctg etc tct tgc ttg act gtg ccc 48 
Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro 



1 
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15 10 15 

get teg gcc tac caa gtg cgc aac tec acg ggg etc tac cae gtc ace 96 
Ala Ser Ala Tyr Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr 
20 25 30 

aat gat tgc ect aac teg agt att gtg tac gag gcg gcc gat gcc ate 144 
Asn Asp Cys Pro Asn Ser Ser lie Val Tyr Glu Ala Ala Asp Ala lie 
35 40 45 

ctg cae act ccg ggg tgc gtc ect tgc gtt cgc gag ggc aac gcc teg 192 
Leu His Thr Pro Gly Cys Val Pro Cys Val Arg Glu Gly Asn Ala Ser 
50 55 60 



agg tgt tgg gtg gcg atg acc cet acg gtg gcc acc agg gat ggc aaa 
Arg Cys Trp Val Ala Met Thr Pro Thr Val Ala Thr Arg Asp Gly Lys 
65 70 75 80 



acg caa ggt tgc aat tgc tet ate tat cec ggc eat ata acg ggt cae 
Thr Gin Gly Cys Asn Cys Ser lie Tyr Pro Gly His lie Thr Gly His 
130 135 140 

cgc atg gea tgg gat atg atg atg aac tgg tec ect acg acg gcg ttg 
Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu 
145 150 155 160 

gta atg get cag ctg etc egg ate cca caa gee ate ttg gae atg ate 
Val Met Ala Gin Leu Leu Arg lie Pro Gin Ala lie Leu Asp Met lie 
165 170 175 



240 



etc ccc gcg acg cag ctt cga cgt cae ate gat ctg ctt gtc ggg age 288 
Leu Pro Ala Thr Gin Leu Arg Arg His He Asp Leu Leu Val Gly Ser 
85 90 95 

gcc acc etc tgt teg gcc etc tac gtg ggg gae ctg tgc ggg tet gtc 336 
Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Ser Val 
100 105 110 

ttt ctt gtc ggc caa ctg ttt acc ttc tet ccc agg cgc cae tgg acg 384 
Phe Leu Val Gly Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr 
115 120 125 



432 



480 



528 



get ggt get cae tgg gga gtc ctg gcg ggc ata gcg tat ttc tec atg 576 
Ala Gly Ala His Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser Met 
180 185 190 

gtg ggg aac tgg gcg aag gtc ctg gta gtg ctg ctg eta ttt gcc ggc 624 
Val Gly Asn Trp Ala Lys Val Leu Val Val Leu Leu Leu Phe Ala Gly 
195 200 205 

gtc gae gcg gaa acc cae gtc acc ggg gga agt gcc ggc cae act gtg 672 
Val Asp Ala Glu Thr His Val Thr Gly Gly Ser Ala Gly His Thr Val 
210 215 220 

tet gga ttt gtt age etc etc gea cca ggc gcc aag cag aac gtc cag 720 
Ser Gly Phe Val Ser Leu Leu Ala Pro Gly Ala Lys Gin Asn Val Gin 
225 230 235 240 

ctg ate aac ace aac ggc agt tgg cae etc aat age acg gee ctg aac 768 
Leu He Asn Thr Asn Gly Ser Trp His Leu Asn Ser Thr Ala Leu Asn 
245 250 255 
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tgc aat gat age etc aac acc ggc tgg ttg gca ggg ctt ttc tat cac 816 
Cys Asn Asp Ser Leu Asn Thr Gly Trp Leu Ala Gly Leu Phe Tyr His 
260 265 270 

cac aag ttc aac tct tea ggc tgt cct gag agg eta gee age tgc ega 864 
His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg 
275 280 285 

ccc ctt acc gat ttt gac cag' ggc tgg ggc cct ate agt tat gcc aac 912 
Pro Leu Thr Asp Phe Asp Gin Gly Trp Gly Pro He Ser Tyr Ala Asn 
290 295 300 

gga age ggc ccc gac cag cgc ccc tac tgc tgg cac tac ccc cea aaa 960 
Gly Ser Gly Pro Asp Gin Arg Pro Tyr Cys Trp His Tyr Pro Pro Lys 
305 310 315 320 

cct tgc ggt att gtg ccc gcg aag agt gtg tgt ggt ccg gta tat tgc 1008 
Pro Cys Gly He Val Pro Ala Lys Ser Val Cys Gly Pro Val Tyr Cys 
325 i 330 335 

ttc act ccc age ccc gtg gtg gtg gga acg acc gac agg teg ggc gcg 1056 
Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala 
340 345 35Q 

CCC acc tac age tgg ggt gaa aat gat acg gac gtc ttc gtc ctt aac 1104 
Pro Thr Tyr Ser Trp Gly Glu Asn Asp Thr Asp Val Phe Val Leu Asn 
355 360 365 

aat ace agg cea ccg ctg ggc aat tgg ttc ggt tgt acc tgg atg aac 1152 
Asn Thr Arg Pro Pro Leu Gly Asn Trp Phe Gly Cys Thr Trp Met Asn 
370 375 380 

tea act gga ttc acc aaa gtg tgc gga gcg cct cct tgt gtc ate gga 1200 
Ser Thr Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Cys Val He Gly 
385 390 395 400 

ggg gcg ggc aac aac acc ctg cac tgc ccc act gat tgc ttc cgc aag 1248 
Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr Asp Cys Phe Arg Lys 
405 410 415 

cat ccg gac gcc aca tac tct egg tgc ggc tec ggt ccc tgg ate aca 1296 
His Pro Asp Ala Thr Tyr Ser Arg Cys Gly Ser Gly Pro Trp He Thr 
420 425 430 • 

ccc agg tgc ctg gtc gac tac ccg tat agg ctt tgg cat tat cct tgt 1344 
Pro Arg Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
435 440 445 

acc ate aac tac act ata ttt aaa ate, agg atg tac gtg gga ggg gtc 1392 
Thr He Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 

450 455 460 

gag cac agg ctg gaa get gcc tgc aac tgg acg egg ggc gaa egt tgc 1440 
Glu His Arg Leu Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys 
465 470 475 480 

gat ctg gaa gat agg gac agg tec gag etc age ccg tta ctg ctg ace 1488 
Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Thr 
485 490 495 

act aca cag tgg cag gtc etc ccg tgt tec ttc aca ace ctg cea gcc 1536 
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Thr Thr Gin Trp Gin Val Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala 
500 505 510 

ttg tec acc ggc etc ate cac etc cac cag aac att gtg gac gtg eag 1584 
Leu Ser Thr Gly Leu lie His Leu His Gin Asn lie Val Asp Val Gin 
515 520 525 

tac ttg tac ggg gtg ggg tea age ate gcg tec tgg gee att aag tgg 1632 
Tyr Leu Tyr Gly Val Gly Ser Ser He Ala Ser Trp Ala He Lys Trp 
530 535 540 

gag tae gte gte etc etg ttc ctt ctg ctt gca gae gcg cgc gtc tgc 1680 
Glu Tyr Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
545 550 555 560 

tec tgc ttg tgg atg atg eta etc ata tee caa gcg gaa gcg get ttg 1728 
Ser Cys Leu Trp Met Met Leu Leu He Ser Gin Ala Glu Ala Ala Leu 
565 570 575 

gag aac etc gta ata ctt aat gca gca tec ctg gee ggg aeg cac ggt 1776 
Glu Asn Leu Val He Leu Asn Ala Ala Ser Leu Ala Gly Thr His Gly 
580 585 590 

Ctt gta tec ttc etc gtg ttc ttc tgc ttt gca tgg tat etg aag ggt 1824 
Leu Val Ser Phe Leu Val Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly 
595 600 605 

aag tgg gtg ccc gga gcg gtc tae ace ttc tac ggg atg tgg ect etc 1872 
Lys Trp Val Pro Gly Ala Val Tyr Thr Phe Tyr Gly Met Trp Pro Leu 
610 615 620 

etc ctg etc etg ttg gcg ttg ccc cag egg gcg tae gcg taa 1914 
Leu Leu Leu Leu Leu Ala Leu Pro Gin Arg Ala Tyr Ala 
625 630 635 



<210> 4 
<211> 637 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HCV-1 El/E2/p7 region 
<400> 4 

Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro 
15 10 15 

Ala Ser Ala Tyr Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr 

20 25 30 

Asn Asp Cys Pro Asn Ser Ser He Val Tyr Glu Ala Ala Asp Ala He 
35 40 45 

Leu His Thr Pro Gly Cys Val Pro Cys Val Arg Glu Gly Asn Ala Ser 
50 55 60 

Arg Cys Trp Val Ala Met Thr Pro Thr Val Ala Thr Arg Asp Gly Lys 
65 70 75 80 

Leu Pro Ala Thr Gin Leu Arg Arg His He Asp Leu Leu Val Gly Ser 



4 



wo 2004/039950 



PCT/US2003/033610 



85 90 95 

Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Ser Val 
100 105 110 

Phe Leu Val Gly Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr 
115 120 125 

Thr Gin Gly Cys Asn Cys Ser lie Tyr Pro Gly His lie Thr Gly His 
130 135 140 

Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu 
145 150 155 160 

Val Met Ala Gin Leu Leu Arg lie Pro Gin Ala lie Leu Asp Met He 
165 170 175 

Ala Gly Ala His Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser Met 
180 185 190 

Val Gly Asn Trp Ala Lys Val Leu Val Val Leu Leu Leu Phe Ala Gly 

195 200 205 

Val Asp Ala Glu Thr His Val Thr Gly Gly Ser Ala Gly His Thr Val 
210 215 220 

Ser Gly Phe Val Ser Leu Leu Ala Pro Gly Ala Lys Gin Asn Val Gin 
225 230 235 240 

Leu He Asn Thr Asn Gly Ser Trp His Leu Asn Ser Thr Ala Leu Asn 
245 250 255 

Cys Asn Asp Ser Leu Asn Thr Gly Trp Leu Ala Gly Leu Phe Tyr His 
260 265 270 

His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg 

275 280 285 

Pro Leu Thr Asp Phe Asp Gin Gly Trp Gly Pro He Ser Tyr Ala Asn 
290 295 300 

Gly Ser Gly Pro Asp Gin Arg Pro Tyr Cys Trp His Tyr Pro Pro Lys 
305 310 315 320 

Pro Cys Gly He Val Pro Ala Lys Ser Val Cys Gly Pro Val Tyr Cys 

325 330 335 

Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala 
340 345 350 

Pro Thr Tyr Ser Trp Gly Glu Asn Asp Thr Asp Val Phe Val Leu Asn 
355 360 365 

Asn Thr Arg Pro Pro Leu Gly Asn Trp Phe Gly Cys Thr Trp Met Asn 
370 375 380 

Ser Thr Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Cys Val He Gly 
385 390 395 400 

Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr Asp Cys Phe Arg Lys 
405 410 415 



5 



wo 2004/039950 



PCTA;S2003/033610 



His Pro Asp Ala Thr Tyr Ser Arg Cys Gly Ser Gly Pro Trp He Thr 

420 425 430 

Pro Arg Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
435 440 445 

Thr He Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 
450 455 460 

Glu His Arg Leu Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys 
465 470 475 480 

Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Thr 
485 490 495 

Thr Thr Gin Trp Gin Val Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala 
500 505 510 

Leu Ser Thr Gly Leu He His Leu His Gin Asn He Val Asp Val Gin 

515 520 525 

Tyr Leu Tyr Gly Val Gly Ser Ser He Ala Ser Trp Ala He Lys Trp 
530 535 540 

Glu Tyr Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
545 550 555 560 

Ser Cys Leu Trp Met Met Leu Leu He Ser Gin Ala Glu Ala Ala Leu 
565 570 575 

Glu Asn Leu Val He Leu Asn Ala Ala Ser Leu Ala Gly Thr His Gly 
580 585 590 

Leu Val Ser Phe Leu Val Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly 
595 600 605 

Lys Trp Val Pro Gly Ala Val Tyr Thr Phe Tyr Gly Met Trp Pro Leu 
610 615 620 



Leu Leu Leu Leu Leu Ala Leu Pro Gin Arg Ala Tyr Ala 
625 630 635 



<210> 5 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus sequence 
<400> 5 

Gly Ser Ala Ala Arg Thr Thr Ser Gly Phe Val Ser Leu Phe Ala Pro 
15 10 15 

Gly Ala Lys Gin Asn 
20 



<210> 6 
<211> 20 



6 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description o£ Artificial Sequence: exemplary CpG 
ol igonucleot ide 

<400> 6 

tccatgacgt tcctgacgtt 20 



<210> 7 
<211> 5676 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: representative 
NS345Core fusion protein 

<220> 

<221> CDS 

<222> (1) . . (5676) 

<400> 7 

atg get gca tat gca get cag ggc tat aag gtg eta gta etc aac ccc 48 
Met Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro 
15 10 15 

tct gtt got gca aca ctg ggc ttt ggt get tac atg tec aag get cat 96 
Ser Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His 
20 25 30 

ggg ate gat act aac ate agg ace ggg gtg aga aca att acc act ggc 144 
Gly lie Asp Pro Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly 
35 40 45 

age ccc ate acg tac tec acc tac ggc aag ttc ctt gee gac ggc ggg 192 

Ser Pro He Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly 

50 55 60 

tgc teg ggg ggc get tat gac ata ata att tgt gac gag tgc cae tec 240 

Cys Ser Gly Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser 

65 70 75 80 

acg gat gcc aca tec ate ttg ggc att ggc act gtc ctt gac caa gca 288 
Thr Asp Ala Thr Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala 
85 90 95 

gag act gcg ggg gcg aga ctg gtt gtg etc gcc acc gcc acc ect ccg 336 
Glu Thr Ala Gly Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro 
100 105 110 

ggc tec gtc act gtg ccc cat ccc aac ate gag gag gtt get ctg tee 384 
Gly Ser Val Thr Val Pro His Pro Asn He Glu Glu Val Ala Leu Ser 
115 120 125 

acc acc gga gag ate ect ttt tac ggc aag get ate ccc etc gaa gta 432 
Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val 
130 135 140 

ate aag ggg ggg aga cat etc ate ttc tgt cat tea aag aag aag tgc 480 



7 
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lie Lys Gly Gly Arg His Leu lie Phe Cys His Ser Lys Lys Lys Cys 
145 150 155 160 

gac gaa etc gcc gca aag ctg gtc gca ttg ggc ate aat gee gtg gee 528 
Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly lie Asn Ala Val Ala 
165 170 175 

tac tac cgc ggt ctt gac gtg tec gtc ate ccg ace age ggc gat gtt 576 
Tyr Tyr Arg Gly Leu Asp Val Ser Val lie Pro Thr Ser Gly Asp Val 
180 185 190 

gtc gtc gtg gca acc gat gcc etc atg acc ggc tat ace ggc gac ttc 624 
Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe 
195 200 205 

gac teg gtg ata gac tgc aat acg tgt gtc acc cag aca gtc gat ttc 672 
Asp Ser Val lie Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe 
210 215 220 

age ctt gac cct acc ttc ace att gag aca ate acg etc ccc caa gat 720 
Ser Leu Asp Pro Thr Phe Thr lie Glu Thr lie Thr Leu Pro Gin Asp 
225 230 235 240 

get gtc tee cgc act caa egt egg ggc agg act ggc agg ggg aag cea 768 
Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro 
245 250 255 

ggc ate tac aga ttt gtg gca ccg ggg gag cgc ccc tec ggc atg ttc 816 
Gly lie Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe 
260 265 270 

gac teg tec gtc etc tgt gag tgc tat gac gca ggc tgt get tgg tat 864 
Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr 
275 280 285 

gag etc acg ccc gcc gag act aca gtt agg eta ega gcg tac atg aac 912 
Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn 
290 295 300 

acc ccg ggg ctt ccc gtg tgc cag gac cat ctt gaa ttt tgg gag ggc 960 
Thr Pro Gly Leu Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly 
305 310 315 320 

gtc ttt aca ggc etc act eat ata gat gee cac ttt eta tec cag aca 1008 
Val Phe Thr Gly Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr 
325 330 335 

aag cag agt ggg gag aac ctt cct tac ctg gta gcg tac caa gee ace 1056 
Lys Gin Ser Gly Glu Asn Leu Pro Tyr Leu Val Ala Tyr Gin Ala Thr 
340 345 350 

gtg tgc get agg get caa gcc cct ccc cea teg tgg gac cag atg tgg 1104 
Val Cys Ala Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp 
355 360 365 

aag tgt ttg att cgc etc aag eee acc etc cat ggg cea aca ccc ctg 1152 
Lys Cys Leu lie Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu 
370 375 380 

eta tac aga ctg ggc get gtt cag aat gaa ate acc ctg acg cac cea 1200 
Leu Tyr Arg Leu Gly Ala Val Gin Asn Glu lie Thr Leu Thr His Pro 
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385 390 395 400 

gtc acc aaa tac ate atg aca tgc atg teg gcc gac ctg gag gtc gtc 1248 
Val Thr Lys Tyr lie Met Thr Cys Met Ser Ala Asp Leu Glu Val Val 
405 410 415 

acg age acc tgg gtg etc gtt ggc ggc gtc ctg get get ttg gcc gcg 1296 
Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 
420 425 430 

tat tgc ctg tea aca ggc tgc gtg gtc ata gtg ggc agg gtc gtc ttg 1344 
Tyr Cys Leu Ser Thr Gly Cys Val Val He Val Gly Arg Val Val Leu 
435 440 445 

tec ggg aag ccg gca ate ata cet gac agg gaa gtc etc tac cga gag 1392 
Ser Gly Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr Arg Glu 
450 455 460 

ttc gat gag atg gaa gag tgc tct cag cac tta ccg tac ate gag caa 1440 
Phe Asp Glu Met Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin 
465 470 475 480 

ggg atg atg etc gee gag cag ttc aag cag aag gcc etc ggc etc ctg 1488 
Gly Met Met Leu Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu 
485 490 495 

cag acc" gcg tec cgt cag gca gag gtt ate gcc cet get gtc cag ace 1536 
Gin Thr Ala Ser Arg Gin Ala Glu Val He Ala Pro Ala Val Gin Thr 
500 505 510 

aac tgg caa aaa etc gag acc ttc tgg gcg aag cat atg tgg aac ttc 1584 
Asn Trp Gin Lys Leu Glu Thr Phe Trp Ala Lys His Met Trp Asn Phe 

515 520 525 

ate agt ggg ata caa tac ttg gcg ggc ttg tea acg ctg cet ggt aac 1632 
He Ser Gly He Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn 
530 535 540 

ecc gcc att get tea ttg atg get ttt aca get get gtc ace age cea 1680 
Pro Ala He Ala Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro 
545 550 555 560 

eta acc act age caa acc etc etc ttc aac ata ttg ggg ggg tgg gtg 1728 
Leu Thr Thr Ser Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp Val 
'565 570 575 

get gcc cag etc gcc gee cec ggt gcc get act gcc ttt gtg ggc get 1776 
Ala Ala Gin Leu Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala 
580 585 590 

ggc tta get ggc gcc gcc ate ggc agt gtt gga ctg ggg aag gtc etc 1824 
Gly Leu Ala Gly Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu 
595 600 605 

ata gac ate ctt gca ggg tat ggc gcg ggc gtg gcg gga get ctt gtg 1872 
He Asp He Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val 
610 615 620 

gca ttc aag ate atg age ggt gag gtc ecc tec acg gag gac ctg gtc 1920 
Ala Phe Lys He Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val 
625 630 635 640 
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aat eta ctg ccc gcc ate etc teg ccc gga gee cte gta gte gge gtg 1968 
Asn Leu Leu Pro Ala lie Leu Ser Pro Gly Ala Leu Val Val Gly Val 
645 650 655 

gtc tgt gca gca ata ctg cgc egg cac gtt ggc ccg ggc gag ggg gea 2016 
Val Cys Ala Ala He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala 
660 665 670 

gtg cag tgg atg aac egg ctg ata gcc ttc gcc tec egg ggg aac cat 2064 
Val Gin Trp Met Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His 
675 680 685 

gtt tec cec acg cac tac gtg ceg gag age gat gca get gee cgc gtc 2112 
Val Ser Pro Thr His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val 
690 695 700 

act gcc ata etc age age etc act gta ace cag etc ctg agg cga ctg 2160 
Thr Ala He Leu Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu 
705 710 715 720 

cac cag tgg ata age teg gag tgt ace act eca tge tec ggt tec tgg 2208 
His Gin Trp He Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp 
725 730 735 

eta agg gac ate tgg gac tgg ata tge gag gtg ttg age gae ttt aag 2256 
Leu Arg Asp He Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys 
740 745 750 

ace tgg eta aaa get aag etc atg eca cag ctg cct ggg ate ccc ttt 2304 
Thr Trp Leu Lys Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe 
755 760 765 

gtg tec tge cag cgc ggg tat aag ggg gtc tgg cga ggg gac gge ate 2352 
Val Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg Gly Asp Gly He 
770 775 780 

atg cac act cgc tge cac tgt gga get gag ate act gga eat gte aaa 2400 
Met His Thr Arg Cys His Cys Gly Ala Glu He Thr Gly His Val Lys 
785 790 795 800 

aac ggg acg atg agg ate gte ggt cct agg ace tge agg aac atg tgg 2448 
Asn Gly Thr Met Arg He Val Gly Pro Arg Thr Cys Arg Asn Met Trp 
805 810 815 

I 

agt ggg ace ttc ccc att aat gcc tac ace acg gge ccc tgt ace ccc 2496 
Ser Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro 
820 825 830 

ctt cct gcg ccg aac tac acg ttc gcg eta tgg agg gtg tct gea gag 2544 
Leu Pro Ala Pro Asn Tyr Thr Phe Ala Leu Trp Arg Val Ser Ala Glu 
835 840 845 

gaa tac gtg gag ata agg cag gtg ggg gac ttc cac tac gtg acg ggt 2592 
Glu Tyr Val Glu He Arg Gin Val Gly Asp Phe His Tyr Val Thr Gly 
850 855 860 

atg act act gac aat ctt aaa tge ccg tge cag gtc eca teg ccc gaa 2640 
Met Thr Thr Asp Asn Leu Lys Cys Pro Cys Gin Val Pro Ser Pro Glu 
865 870 875 880 
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ttt ttc aca gaa ttg gac ggg gtg cgc eta cat agg ttt gcg ccc ccc 2688 

Phe Phe Thr Glu Leu Asp Gly Val Arg heu His Arg Phe Ala Pro Pro 

885 890 895 

tgc aag ccc ttg ctg egg gag gag gta tea ttc aga gta gga etc eac 2736 

Cys Lys Pro Leu Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His 
900 905 910 



gaa tae eeg gta ggg teg caa tta cct tge gag ccc gaa eeg gac gtg 
Glu Tyr Pro Val Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val 
915 920 925 

gee gtg ttg aeg tec atg etc act gat eee tee eat ata aca gea gag 

Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His lie Thr Ala Glu 

930 935 940 

» 

geg gee ggg cga agg ttg gcg agg gga tea ccc ccc tet gtg gee age 

Ala Ala Gly Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Val Ala Ser 

945 950 955 960 

tec teg get age eag eta tec get cea tet etc aag gca act tgc acc 
Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr 
965 970 975 

get aac eat gac tec cct gat get gag etc ata gag gee aac etc eta 
Ala Asn His Asp Ser Pro Asp Ala Glu Leu lie Glu Ala Asn Leu Leu 
980 985 990 

tgg agg cag gag atg ggc ggc aac ate acc agg gtt gag tea gaa aac 
Trp Arg Gin Glu Met Gly Gly Asn lie Thr Arg Val Glu Ser Glu Asn 
995 1000 . 1005 

aaa gtg gtg att ctg gac tee ttc gat ccg ctt gtg gcg gag gag gac 
Lys Val Val lie Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp 
1010 1015 1020 

gag egg gag ate tec gta ccc gca gaa ate ctg egg aag tet egg aga 
Glu Arg Glu lie Ser Val Pro Ala Glu He Leu Arg Lys Ser Arg Arg 
1025 1030 1035 1040 

ttc gee cag gcc ctg ccc gtt tgg gcg egg ccg gac tat aac eee ccg 
Phe Ala Gin Ala Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro 
1045 1050 1055 

eta gtg gag acg tgg aaa aag ccc gac tae gaa cea cct gtg gtc cat 
Leu Val Glu Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His 
1060 1065 1070 

ggc tge eeg ett cea cct cea aag tec cct cct gtg cct eeg cct egg 
Gly Cys Pro Leu Pro Pro Pro Lys Ser Pro Pro Val Pro Pro Pro Arg 
1075 1080 1085 

aag aag egg acg gtg gtc etc act gaa tea ace eta tet act gee ttg 
Lys Lys Arg Thr Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu 
1090 1095 1100 

gcc gag etc gcc acc aga age ttt ggc age tec tea act tec ggc att 
Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser Ser Ser Thr Ser Gly He 
1105 1110 1115 1120 

acg ggc gac aat aeg aca aca tee tet gag ccc gee cct tet ggc tge 



2784 



2832 



2880 



2928 



2976 



3024 



3072 



3120 



3168 



3216 



3264 



3312 



3360 



3408 
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Thr Gly Asp Asn Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys 
1125 1130 1135 

ccc ccc gac tec gac get gag tec tat tee tec atg cce eee etg gag 3456 
Pro Pro Asp Ser Asp Ala Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu 
1140 1145 1150 

ggg gag cot ggg gat ccg gat ctt age gac ggg tea tgg tea aeg gte 3504 
Gly Glu pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val 
1155 1160 1165 

agt agt gag gcc aac gcg gag gat gtc gtg tgc tgc tea atg tct tac 3552 
Ser Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys Ser Met Ser Tyr 
1170 1175 1180 

tct tgg aca ggc gea etc gtc acc ccg tgc gcc gcg gaa gaa eag aaa 3600 
Ser Trp Thr Gly Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys 
1185 1190 1195 1200 

etg eee ate aat gea eta age aac teg ttg eta egt eac cac aat ttg 3648 
Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu 
1205 1210 1215 

gtg tat tee ace ace tea ege agt get tgc caa agg eag aag aaa gte 3696 
Val Tyr Ser Thr Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val 
1220 1225 1230 

aca ttt gac aga etg caa gtt etg gac age cat tac cag gac gta etc 3744 
Thr Phe Asp Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu 
1235 1240 1245 



aag gag gtt aaa gea geg geg tea aaa gtg aag get aac ttg eta tec 
Lys Glu Val Lys Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser 
1250 1255 1260 



cct gag aag ggg ggt egt aag cca get egt etc ate gtg tte ccc gat 
Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp 
1330 1335 1340 

etg gge gtg ege gtg tgc gaa aag atg get ttg tac gac gtg gtt aca 
Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Thr 
1345 1350 1355 1360 

aag etc eee ttg gee gtg atg gga age tee tac gga tte eaa tac tea 
Lys Leu Pro Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser 



3792 



gta gag gaa get* tgc age etg aeg eee cca eac tea gee aaa tee aag 3840 
Val Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys 
1265 1270 1275 1280 

ttt ggt tat ggg gea aaa gac gte egt tgc cat gee aga aag gee gta 3888 
Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val 
1285 1290 1295 

ace eac ate aac tee gtg tgg aaa gac ctt etg gaa gac aat gta aca 3936 
Thr His He Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val Thr 
1300 1305 1310 

cca ata gae act ace ate atg get aag aac gag gtt tte tgc gtt cag 3984 
Pro He Asp Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin 
1315 1320 1325 



4032 



4080 



4128 
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1365 



1370 



1375 



cca gga cag egg gtt gaa ttc etc gtg eaa gcg tgg aag tec aag aaa 
Pro Gly Gin Arg Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys 
1380 1385 1390 

acc cca atg ggg ttc teg tat gat acc cgc tgc ttt gac tec aca gtc 
Thr Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val 
1395 1400 1405 



4176 



4224 



act gag age gac ate cgt acg gag gag gca ate tac caa tgt tgt gac 
Thr Glu Ser Asp He Arg Thr Glu Glu Ala He Tyr Gin Cys Cys Asp 
1410 1415 1420 

etc gac ecc eaa gee cgc gtg gee ate aag tec etc ace gag agg ett 
Leu Asp Pro Gin Ala Arg Val Ala He Lys Ser Leu Thr Glu Arg Leu 
1425 1430 1435 1440 

tat gtt ggg ggc cct ctt acc aat tea agg ggg gag aac tgc ggc tat 
Tyr Val Gly Gly Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr 
1445 1450 1455 

cgc agg tgc cgc gcg age ggc gta ctg aca act age tgt ggt aac acc 
Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr 
1460 1465 1470 

etc act tgc tac ate aag gee egg gca gee tgt ega gee gca ggg etc 
Leu Thr Cys Tyr He Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu 
1475 1480 1485 

cag gac tgc acc atg etc gtg tgt ggc gac gac tta gtc gtt ate tgt 
Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Cys 
1490 1495 1500 

gaa age gcg ggg gtc cag gag gac gcg gcg age ctg aga gee ttc acg 
Glu Ser Ala Gly Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr 
1505 1510 1515 1520 

gag get atg acc agg tac tec gee cec cct ggg gac ecc cca eaa cca 
Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro 
1525 1530 1535 

gaa tac gac ttg gag etc ata aca tea tgc tec tec aac gtg tea gtc 
Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val 
1540 1545 1550 

gee cac gac ggc get gga aag agg gtc tac tac etc acc cgt gac cct 
Ala His Asp Gly Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro 
1555 1560 1565 

aca ace ecc etc gcg aga get gcg tgg gag aca gca aga cac act cca 
Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro 
1570 1575 1580 

gtc aat tec tgg eta ggc aac ata ate atg ttt gee ecc aca ctg tgg 
Val Asn Ser Trp Leu Gly Asn He He Met Phe Ala Pro Thr Leu Trp 
1585 1590 1595 1600 

gcg agg atg ata ctg atg acc cat ttc ttt age gtc ctt ata gee agg 
Ala Arg Met He Leu Met Thr His Phe Phe Ser Val Leu He Ala Arg 
1605 1610 1615 



4272 



4320 



4368 



4416 



4464 



4512 



4560 



4608 



4656 



4704 



4752 



4800 



4848 
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gac cag ctt gaa cag gcc etc gat tgc gag ate tac ggg gcc tgc tac 4896 
Asp Gin Leu Glu Gin Ala Leu Asp Cys Glu He Tyr Gly Ala Cys Tyr 
1620 1625 1630 

tec ata gaa cca ctg gat eta cet cca ate att caa aga etc cat ggc 4944 
Ser He Glu Pro Leu Asp Leu Pro Pro He He Gin Arg Leu His Gly . 
1635 1640 1645 

etc age gca ttt tea etc cac agt tac tct cca ggt gaa ate aat agg 4992 
Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg 
1650 1655 1660 



gtg gcc gca tgc etc aga aaa ctt ggg gta ecg eee ttg ega get tgg 
Val Ala Ala Cys Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp 
1665 1670 1675 1680 

aga cac egg gee egg age gtc cgc get agg ctt ctg gcc aga gga ggc 
Arg His Arg Ala Arg Ser Val Arg Ala Arg Leu Leu Ala Arg Gly Gly 
1685 1690 1695 



tet cat gee egg eee cgc tgg ate tgg ttt tgc eta etc ctg ctt get 
Ser His Ala Arg Pro Arg Trp He Trp Phe Cys Leu Leu Leu Leu Ala 
1750 1755 1760 



1745 



gag egg teg caa cct ega ggt aga cgt cag cet ate ccc aag get cgt 
Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro He Pro Lys Ala Arg 
1825 1830 1835 1840 



5040 



5088 



agg get gcc ata tgt ggc aag tac etc ttc aac tgg gca gta aga aca 5136 
Arg Ala Ala He Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr 
1700 1705 1710 

aag etc aaa etc act cca ata gcg gcc get ggc cag ctg gac ttg tee 5184 
Lys Leu Lys Leu Thr Pro He Ala Ala Ala Gly Gin Leu Asp Leu Ser 
1715 1720 1725 

ggc tgg ttc aeg get ggc tac age ggg gga gac att tat cac age gtg 5232 
Gly Trp Phe Thr Ala Gly Tyr Ser Gly Gly Asp He Tyr His Ser Val 
1730 1735 1740 



5280 



5328 



gca ggg gta ggc ate tac etc etc ccc aac ega atg age acg aat cet 
Ala Gly Val Gly He Tyr Leu Leu Pro Asn Arg Met Ser Thr Asn Pro 
1765 1770 1775 

aaa cct caa aga aag acc aaa cgt aac acc aac egg egg ecg cag gac 5376 
Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp 
1780 1785 1790 

gtc aag ttc ecg ggt ggc ggt cag ate gtt ggt gga gtt tac ttg ttg 5424 
Val Lys Phe Pro Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu 
1795 1800 1805 

ecg cgc agg ggc cct aga ttg ggt gtg cgc gcg acg aga aag act tee 5472 
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Thr Ser 
1810 1815 1820 



5520 



egg ccc gag ggc agg acc tgg get cag ccc ggg tac cet tgg ccc etc 5568 
Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly Tyr Pro Trp Pro Leu 
1845 1850 1855 
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tat ggc aat gag ggc tgc ggg tgg gcg gga tgg etc ctg tct ccc cgt 5616 
Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 
1860 1865 1870 

ggc tct egg cct age tgg ggc ccc aca gac ccc egg cgt agg teg cgc 5664 
Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg Ser Arg 
1875 1880 1885 

aat ttg ggt aag 5676 
Asn Leu Gly Lys 
. 1890 



<210> 8 
<211> 1892 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: representative 
NS345Core fusion protein 

<400> 8 

Met Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro 
15 10 15 

Ser Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His 
20 25 30 

Gly He Asp Pro Asn He Arg Thr Gly Val Arg Thr He Thr Thr Gly 
35 40 45 

Ser Pro He Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly 
50 55 60 

Cys Ser Gly Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser 

65 70 75 80 

Thr Asp Ala Thr Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala 
85 90 95 

Glu Thr Ala Gly Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro 
100 105 110 

Gly Ser Val Thr Val Pro His Pro Asn He Glu Glu Val Ala Leu Ser 
lis 120 125 

Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val 
130 135 140 

He Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys 

145 150 155 160 

Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala 
165 170 175 

Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val 
180 185 190 

Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe 
195 200 205 
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Asp Ser Val lie Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe 
210 215 220 

Ser Leu Asp Pro Thr Phe Thr lie Glu Thr lie Thr Leu Pro Gin Asp 
225 230 235 240 

Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro 
245 250 255 

Gly lie Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe 
260 265 270 

Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr 
275 280 285 

Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn 
290 295 300 

Thr Pro Gly Leu Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly 
305 310 315 . 320 

Val Phe Thr Gly Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr 

325 330 335 

Lys Gin Ser Gly Glu Asn Leu Pro Tyr Leu Val Ala Tyr Gin Ala Thr 
340 345 350 

Val Cys Ala Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp 
355 360 365 

Lys Cys Leu He Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu 
370 375 380 

Leu Tyr Arg Leu Gly Ala Val Gin Asn Glu He Thr Leu Thr His Pro 
385 390 395 400 

Val Thr Lys Tyr He Met Thr Cys Met Ser Ala Asp Leu Glu Val Val 
405 410 415 

Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 
420 425 430 

Tyr Cys Leu Ser Thr Gly Cys Val Val He Val Gly Arg Val Val Leu 
435 440 445 

Ser Gly Lys Pro Ala He He Pro Asp Arg Glu Val Leu Tyr Arg Glu 
450 455 460 

Phe Asp Glu Met Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin 
465 470 475 480 

Gly Met Met Leu Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu 
485 490 495 

Gin Thr Ala Ser Arg Gin Ala Glu Val He Ala Pro Ala Val Gin Thr 
500 505 510 



Asn Trp Gin Lys Leu Glu Thr Phe Trp Ala Lys His Met Trp Asn Phe 
515 520 525 



16 



wo 2004/039950 



PCT/US2003/033610 



lie Ser Gly lie Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn 
530 535 540 

Pro Ala He Ala Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro 
545 550 555 560 

Leu Thr Thr Ser Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp Val 
565 570 575 

Ala Ala Gin Leu Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala 

580 585 590 

Gly Leu Ala Gly Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu 
595 600 605 

He Asp He Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val 
610 615 620 

Ala Phe Lys ile Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val 
625 630 635 640 

Asn Leu Leu Pro Ala He Leu Ser Pro Gly Ala Leu Val Val Gly Val 
645 650 655 

Val Cys Ala Ala Ile Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala 
660 665 670 

Val Gin Trp Met Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His 
675 680 685 

Val Ser Pro Thr His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val 
690 695 700 

Thr Ala Ile Leu Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu 

705 710 715 720 

His Gin Trp He Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp 
725 730 735 

Leu Arg Asp He Trp Asp Trp Ile Cys Glu Val Leu Ser Asp Phe Lys 
740 745 750 

Thr Trp Leu Lys Ala Lys Leu Met Pro Gin Leu Pro Gly Ile Pro Phe 

755 760 765 

Val Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg Gly Asp Gly lie 
770 775 780 

Met His Thr Arg Cys His Cys Gly Ala Glu Ile Thr Gly His Val Lys 
785 790 795 800 

Asn Gly Thr Met Arg He Val Gly Pro Arg Thr Cys Arg Asn Met Trp 
805 810 815 

Ser Gly Thr Phe Pro Ile Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro 
820 825 830 



Leu Pro Ala Pro Asn Tyr Thr Phe Ala Leu Trp Arg Val Ser Ala Glu 
835 840 845 

Glu Tyr Val Glu lie Arg Gin Val Gly Asp Phe His Tyr Val Thr Gly 
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850 855 860 

Met Thr Thr Asp Asn Leu Lys Cys Pro Cys Gin Val Pro Ser Pro Glu 
865 670 875 880 

Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro 

885 690 895 

Cys Lys Pro Leu Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His 

900 905 910 

Glu Tyr Pro Val Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val 
915 920 925 

Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu 
930 935 940 

Ala Ala Gly Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Val Ala Ser 
945 950 955 960 

Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr 
965 970 975 

Ala Asn His Asp Ser Pro Asp Ala Glu Leu He Glu Ala Asn Leu Leu 
980 965 990 

Trp Arg Gin Glu Met Gly Gly Asn He Thr Arg Val Glu Ser Glu Asn 
995 1000 1005 

Lys Val Val He Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp 
1010 1015 1020 

Glu Arg Glu He Ser Val Pro Ala Glu He Leu Arg Lys Ser Arg Arg 
1025 1030 1035 1040 

Phe Ala Gin Ala Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro 
1045 1050 1055 

Leu Val Glu Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His 
1060 1065 1070 

Gly Cys Pro Leu Pro Pro Pro Lys Ser Pro Pro Val Pro Pro Pro Arg 
1075 1080 1085 

Lys Lys Arg Thr Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu 
1090 1095 1100 

Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser Ser Ser Thr Ser Gly He 
1105 1110 1115 1120 

Thr Gly Asp Asn Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys 
1125 1130 1135 

Pro Pro Asp Ser Asp Ala Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu 
1140 1145 1150 

Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val 
1155 1160 1165 

Ser Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys Ser Met Ser Tyr 
1170 1175 1160 
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Ser Trp Thr Gly Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys 
1185 1190 1195 1200 

Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu 
1205 1210 1215 

Val Tyr Ser Thr Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val 
1220 1225 1230 

Thr Phe Asp Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu 
1235 1240 1245 

Lys Glu Val Lys Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser 
1250 1255 1260 

Val Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys 
1265 1270 1275 1280 

Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val 
1285 1290 1295 

Thr His He Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val Thr 
1300 1305 1310 

Pro He Asp Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin 
1315 1320 1325 

Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp 
1330 1335 1340 

Leu Gly val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Thr 
1345 1350 1355 1360 

Lys Leu Pro Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser 
1365 1370 1375 

pro Gly Gin Arg Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys 
1380 1385 1390 

Thr Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val 
1395 1400 1405 

Thr Glu Ser Asp He Arg Thr Glu Glu Ala He Tyr Gin Cys Cys Asp 
1410 1415 1420 

Leu Asp Pro Gin Ala Arg Val Ala He Lys Ser Leu Thr Glu Arg Leu 
1425 1430 1435 1440 

Tyr Val Gly Gly Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr 
1445 1450 1455 

Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr 
1460 1465 1470 

Leu Thr Cys Tyr He Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu 
1475 1480 1485 

Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Cys 
1490 1495 1500 

Glu Ser Ala Gly Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr 
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1505 1510 1515 1520 

Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro 
1525 1530 1535 

Glu Tyr Asp Leu Glu Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val 
1 1540 1545 1550 

Ala His Asp Gly Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro 
1555 1560 1565 

Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro 
1570 1575 1580 

Val Asn Ser Trp Leu Gly Asn He He Met Phe Ala Pro Thr Leu Trp 
1585 1590 1595 1600 

Ala Arg Met He Leu Met Thr His Phe Phe Ser Val Leu He Ala Arg 
1605 1610 1615 

Asp Gin Leu Glu Gin Ala Leu Asp Cys Glu He Tyr Gly Ala Cys Tyr 
1620 . 1625 1630 

Ser He Glu Pro Leu Asp Leu Pro Pro He He Gin Arg Leu His Gly 
1635 1640 1645 

Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg 
1650 1655 1660 

Val Ala Ala Cys Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp 
1665 1670 1675 1680 

Arg His Arg Ala Arg Ser Val Arg Ala Arg Leu Leu Ala Arg Gly Gly 
1685 1690 1695 

Arg Ala Ala He Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr 
1700 1705 1710 

Lys Leu Lys Leu Thr Pro He Ala Ala Ala Gly Gin Leu Asp Leu Ser 
1715 1720 1725 

Gly Trp Phe Thr Ala Gly Tyr Ser Gly Gly Asp He Tyr His Ser Val 
1730 1735 1740 

Ser His Ala Arg Pro Arg Trp He Trp Phe Cys Leu Leu Leu Leu Ala 
1745 1750 1755 1760 

Ala Gly Val Gly He Tyr Leu Leu Pro Asn Arg Met Ser Thr Asn Pro 
1765 1770 1775 

Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp 
1780 1785 1790 

Val Lys Phe Pro Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu 
1795 1800 1805 

Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Thr Ser 
1810 1815 1820 

Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro He Pro Lys Ala Arg 
1825 1830 1835 1840 
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Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly Tyr Pro Trp Pro Leu 
1B45 1850 1855 

Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 
1860 1865 1B70 

Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg Ser Arg 
1875 1880 1885 

Asn Leu Gly Lys 
1890 



<210> 9 
<211> 546 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of T^tlficial Sequence: representative 
native NS3 protease domain 

<220> 

<221> CDS 

<222> (1) . . (546) 

<400> 9 

atg gcg ccc ate acg gcg tac gcc cag cag aca agg ggc etc eta ggg 48 
Met Ala Pro He Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
15 10 15 

tge ata ate ace age eta act ggc egg gac aaa aac caa gtg gag ggt 96 
Cys He He Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly 
20 25 ' 30 

gag gtc cag att gtg tea act get gcc caa acc ttc ctg gca acg tge 144 
Glu Val Gin He Val Ser Thr Ala Ala Gin Thr Phe Leu Ala Thr Cys 
35 40 45 

ate aat ggg gtg tge tgg act gtc tac cac ggg gcc gga acg agg acc 192 
He Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 
50 55 60 

ate gcg tea eee aag ggt eet gtc ate cag atg tat ace. aat gta gac 240 
He Ala Ser Pro Lys Gly Pro Val He Gin Met Tyr Thr Asn Val Asp 
65 70 75 80 

caa gac ett gtg ggc tgg ccc get ceg caa ggt age ega tea ttg aca 288 
Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser Leu Thr 
85 • 90 95 



ccc tge act tge ggc tec teg gac ett tac ctg gtc acg agg cac gee 336 
Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 
100 105 110 

gat gtc att eee gtg cgc egg egg ggt gat age agg ggc age ctg ctg 384 
Asp Val He Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu 
115 120 125 

teg ccc egg ccc att tec tac ttg aaa ggc tec teg ggg ggt ceg ctg 432 
Ser Pro Arg Pro He Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu 
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130 135 140 

ttg tgc ccc gcg ggg cac gcc gtg ggc ata ttt agg gcc gcg gtg tgc 480 
Leu Cys Pro Ala Gly His Ala Val Gly lie Phe Arg Ala Ala Val Cys 
145 150 155 160 

acc cgt gga gtg get aag gcg gtg gac ttt ate cct gtg gag aac eta 528 
Thr Arg Gly Val Ala Lys Ala Val Asp Phe lie Pro Val Glu Asn Leu 
165 170 175 

gag aca acc atg agg tec 546 
Glu Thr Thr Met Arg Ser 
180 



<210> 10 
<211> 182 . 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: representative 
native NS3 protease domain 

<400> 10 

Met Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
15 10 15 

Cys lie lie Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly 
20 25 30 

Glu Val Gin lie Val Ser Thr Ala Ala Gin Thr Phe Leu Ala Thr Cys 
35 40 45 

lie Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 
50 55 60 

lie Ala Ser Pro Lys Gly Pro Val lie Gin Met Tyr Thr Asn Val Asp 
65 70 75 80 

Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser Leu Thr 
85 90 95 

Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 
100 105 110 

Asp Val lie Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu 
115 » 120 125 

Ser Pro Arg Pro lie Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu 
130 135 140 

Leu Cys Pro Ala Gly His Ala Val Gly lie Phe Arg Ala Ala Val Cys 
145 150 155 160 

Thr Arg Gly Val Ala Lys Ala Val Asp Phe He Pro Val Glu Asn Leu 
165 170 175 

Glu Thr Thr Met Arg Ser 
180 
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